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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Gallatin, W. Michael 
Kilgannon, Patrick D. 

(ii) TITLE OF INVENTION: ICAM-4 Materials and Methods 

(iii) NUMBER OF SEQUENCES: 42 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Marshall, O' Toole, Gerstein, Murray & Borun 

(B) STREET: 233 South Wacker Drive, 6300 Sears Tower 

(C) CITY: Chicago 

(D) STATE: Illinois 

(E) COUNTRY: United States of America 

(F) ZIP: 60606-6402 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/827,689 

(B) FILING DATE: 27 -JAN- 1992 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/889,724 

(B) FILING DATE: 26-MAY-1992 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/894,061 

(B) FILING DATE: 05-JUN-1992 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/009,266 

(B) FILING DATE: 22 -JAN- 1993 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/102,852 

(B) FILING DATE: 05 -AUG -19 93 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/24 5,295 

(B) FILING DATE: 18-MAY-1994 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/485,604 

(B) FILING DATE: 07-JUN-1995 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: WILLIAMS, JR. JOSEPH A. 

(B) REGISTRATION NUMBER: 38,659 

(C) REFERENCE /DOCKET NUMBER: 27866/33321 
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(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 312-474-6300 

(B) TELEFAX: 312-474-0448 

(C) TELEX: 25-3856 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 88 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 61.. 2814 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

AATTCGATCA CTCGCGCTCC CCTCGCCTTC TGCGCTCTCC CCTCCCTGGC AGCGGCGGCA 

ATG CCG GGG CCT TCA CCA GGG CTG CGC CGA ACG CTC CTC GGC CTC TGG 
Met Pro Gly Pro Ser Pro Gly Leu Arg Arg Thr Leu Leu Gly Leu Trp 
15 10 15 

GCT GCC CTG GGC CTG GGG ATC CTA GGC ATC TCA GCG GTC GCG CTA GAA 
Ala Ala Leu Gly Leu Gly lie Leu Gly He Ser Ala Val Ala Leu Glu 
20 25 30 

CCT TTC TGG GCG GAC CTT CAG CCC CGC GTG GCG CTC GTG GAG CGC GGG 
Pro Phe Trp Ala Asp Leu Gin Pro Arg Val Ala Leu Val Glu Arg Gly 
35 40 45 

GGC TCG CTG TGG CTC AAC TGC AGC ACT AAC TGT CCG AGG CCG GAG CGC 
Gly Ser Leu Trp Leu Asn Cys Ser Thr Asn Cys Pro Arg Pro Glu Arg 
50 55 60 

GGT GGC CTG GAG ACC TCG CTA CGC CGA AAC GGG ACC CAG AGG GGT CTG 
Gly Gly Leu Glu Thr Ser Leu Arg Arg Asn Gly Thr Gin Arg Gly Leu 
65 70 75 80 

CGC TGG CTG GCT CGA CAG CTG GTG GAC ATC CGA GAG CCT GAA ACC CAG 
Arg Trp Leu Ala Arg Gin Leu Val Asp He Arg Glu Pro Glu Thr Gin 
85 90 95 

CCG GTC TGC TTC TTC CGC TGC GCG CGC CGC ACA CTC CAA GCG CGT GGG 
Pro Val Cys Phe Phe Arg Cys Ala Arg Arg Thr Leu Gin Ala Arg Gly 
100 105 " 110 

CTC ATC CGA ACT TTC CAG CGA CCG GAT CGG GTA GAG CTA GTG CCT CTG 
Leu He Arg Thr Phe Gin Arg Pro Asp Arg Val Glu Leu Val Pro Leu 
115 120 125 

CCT CCT TGG CAG CCT GTA GGT GAG AAC TTC ACC TTG AGC TGC AGG GTC 
Pro Pro Trp Gin Pro Val Gly Glu Asn Phe Thr Leu Ser Cys Arg Val 
130 135 140 

CCG GGG GCA GGA CCC CGA GCG AGC CTC ACA TTG ACC TTG CTG CGA GGC 
Pro Gly Ala Gly Pro Arg Ala Ser Leu Thr Leu Thr Leu Leu Arg Gly 
145 150 155 160 
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GGC CAG GAG CTG ATT CGC CGA AGT TTC GTA GGC GAG CCA CCC CGA GCT 588 
Gly Gin Glu Leu lie Arg Arg Ser Phe Val Gly Glu Pro Pro Arg Ala 
165 170 175 

CGG GGT GCG ATG CTC ACC GCC ACG GTC CTG GCG CGC AGA GAG GAT CAC 636 
Arg Gly Ala Met Leu Thr Ala Thr Val Leu Ala Arg Arg Glu Asp His 
180 185 190 

AGG GCC AAT TTC TCA TGC CTC GCG GAG CTT GAC CTG CGG CCA CAC GGC 684 
Arg Ala Asn Phe Ser Cys Leu Ala Glu Leu Asp Leu Arg Pro His Gly 
195 200 205 

TTG GGA CTG TTT GCA AAC AGC TCA GCC CCC AGA CAG CTC CGC ACG TTT 732 
Leu Gly Leu Phe Ala Asn Ser Ser Ala Pro Arg Gin Leu Arg Thr Phe 
210 215 220 

GCC ATG CCT CCA CTT TCC CCG AGC CTT ATT GCC CCA CGA TTC TTA GAA 780 
Ala Met Pro Pro Leu Ser Pro Ser Leu lie Ala Pro Arg Phe Leu Glu 
225 230 235 240 

GTG GGC TCA GAA AGG CCG GTG ACT TGC ACT TTG GAT GGA CTG TTT CCT 828 
Val Gly Ser Glu Arg Pro Val Thr Cys Thr Leu Asp Gly Leu Phe Pro 
245 250 ' 255 

GCC CCA GAA GCC GGG GTT TAC CTC TCT CTG GGA GAT CAG AGG CTT CAT 876 
Ala Pro Glu Ala Gly Val Tyr Leu Ser Leu Gly Asp Gin Arg Leu His 
260 265 270 

CCT AAT GTG ACC CTC GAC GGG GAG AGC CTT GTG GCC ACT GCC ACA GCT 924 
Pro Asn Val Thr Leu Asp Gly Glu Ser Leu Val Ala Thr Ala Thr Ala 
275 280 285 

ACA GCA AGT GAA GAA CAG GAA GGC ACC AAA CAG CTG ATG TGC ATC GTG 9 72 

Thr Ala Ser Glu Glu Gin Glu Gly Thr Lys Gin Leu Met Cys lie Val 
290 295 300 

ACC CTC GGG GGC GAA AGC AGG GAG ACC CAG GAA AAC CTG ACT GTC TAC 1020 
Thr Leu Gly Gly Glu Ser Arg Glu Thr Gin Glu Asn Leu Thr Val Tyr 
305 310 315 320 

AGC TTC CCG GCT CCT CTT CTG ACT TTA AGT GAG CCA GAA GCC CCC GAG 1068 
Ser Phe Pro Ala Pro Leu Leu Thr Leu Ser Glu Pro Glu Ala Pro Glu 
325 330 335 

GGA AAG ATG GTG ACC GTA AGC TGC TGG GCA GGG GCC CGA GCC CTT GTC 1116 
Gly Lys Met Val Thr Val Ser Cys Trp Ala Gly Ala Arg Ala Leu Val 
340 345 350 

ACC TTG GAG GGA ATT CCA GCT GCG GTC CCT GGG CAG CCC GCT GAG CTC 1164 
Thr Leu Glu Gly He Pro Ala Ala Val Pro Gly Gin Pro Ala Glu Leu 
355 360 365 

CAG TTA AAT GTC ACA AAG AAT GAC GAC AAG CGG GGC TTC TTC TGC GAC 1212 
Gin Leu Asn Val Thr Lys Asn Asp Asp Lys Arg Gly Phe Phe Cys Asp 
370 375 380 

GCT GCC CTC GAT GTG GAC GGG GAA ACT CTG AGA AAG AAC CAG AGC TCT 1260 
Ala Ala Leu Asp Val Asp Gly Glu Thr Leu Arg Lys Asn Gin Ser Ser 
385 390 395 400 

GAG CTT CGT GTT CTG TAC GCA CCT CGG CTG GAT GAC TTG GAC TGT CCC 1308 
Glu Leu Arg Val Leu Tyr Ala Pro Arg Leu Asp Asp Leu Asp Cys Pro 
405 410 * 415 
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AGG AGC TGG ACG TGG CCA GAG GGT CCA GAG CAG ACC CTC CAC TGC GAG 
Arg Ser Trp Thr Trp Pro Glu Gly Pro Glu Gin Thr Leu His Cys Glu 
420 425 430 

GCC CGT GGA AAC CCT GAG CCC TCC GTG CAC TGT GCA AGG CCT GAC GGT 
Ala Arg Gly Asn Pro Glu Pro Ser Val His Cys Ala Arg Pro Asp Gly 
435 440 445 

GGG GCG GTG CTA GCG CTG GGC CTG TTG GGT CCA GTG ACC CGT GCC CTC 
Gly Ala Val Leu Ala Leu Gly Leu Leu Gly Pro Val Thr Arg Ala Leu 
450 455 * 460 

GCG GGC ACT TAC CGA TGT ACA GCA ATC AAT GGG CAA GGC CAG GCG GTC 
Ala Gly Thr Tyr Arg Cys Thr Ala He Asn Gly Gin Gly Gin Ala Val 
465 470 475 480 

AAG GAT GTG ACC CTG ACT GTG GAA TAT GCC CCA GCG CTG GAC AGT GTA 
Lys Asp Val Thr Leu Thr Val Glu Tyr Ala Pro Ala Leu Asp Ser Val 
485 490 495 

GGC TGC CCA GAA CGT ATT ACT TGG CTG GAG GGG ACA GAG GCA TCG CTT 
Gly Cys Pro Glu Arg He Thr Trp Leu Glu Gly Thr Glu Ala Ser Leu 
500 505 510 

AGC TGT GTG GCA CAC GGG GTC CCA CCA CCT AGC GTG AGC TGT GTG CGC 
Ser Cys Val Ala His Gly Val Pro Pro Pro Ser Val Ser Cys Val Arg 
515 520 525 

TCT GGA AAG GAG GAA GTC ATG GAA GGG CCC CTG CGT GTG GCC CGG GAG 
Ser Gly Lys Glu Glu Val Met Glu Gly Pro Leu Arg Val Ala Arg Glu 
530 535 540 

CAC GCT GGC ACT TAC CGA TGC GAA GCC ATC AAC GCC AGG GGA TCA GCG 
His Ala Gly Thr Tyr Arg Cys Glu Ala He Asn Ala Arg Gly Ser Ala 
545 550 555 560 

GCC AAA AAT GTG GCT GTC ACG GTG GAA TAT GGT CCC AGT TTT GAG GAG 
Ala Lys Asn Val Ala Val Thr Val Glu Tyr Gly Pro Ser Phe Glu Glu 
565 570 575 

TTG GGC TGC CCC AGC AAC TGG ACT TGG GTA GAA GGA TCT GGA AAA CTG 
Leu Gly Cys Pro Ser Asn Trp Thr Trp Val Glu Gly Ser Gly Lys Leu 
580 585 590 

TTT TCC TGT GAA GTT GAT GGG AAG CCG GAA CCA CGC GTG GAG TGC GTG 
Phe Ser Cys Glu Val Asp Gly Lys Pro Glu Pro Arg Val Glu Cys Val 
595 600 605 

GGC TCG GAG GGT GCA AGC GAA GGG GTA GTG TTG CCC CTG GTG TCC TCG 
Gly Ser Glu Gly Ala Ser Glu Gly Val Val Leu Pro Leu Val Ser Ser 
610 G15 620 

AAC TCT GGT TCC AGA AAC TCT ATG ACT CCT GGT AAC CTG TCA CCG GGT 
Asn Ser Gly Ser Arg Asn Ser Met Thr Pro Gly Asn Leu Ser Pro Gly 
625 630 635 640 

ATT TAC CTC TGC AAC GCC ACC AAC CGG CAT GGC TCC ACA GTC AAA ACA 
He Tyr Leu Cys Asn Ala Thr Asn Arg His Gly Ser Thr Val Lys Thr 
645 650 655 

GTC GTC GTG AGC GCG GAA TCA CCG CCA CAG ATG GAT GAA TCC AGT TGC 
Val Val Val Ser Ala Glu Ser Pro Pro Gin Met Asp Glu Ser Ser Cys 
660 665 670 
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CCG AGT CAC CAG ACA TGG CTG GAA GGA GCC GAG GCT ACT GCG CTG GCC 
Pro Ser His Gin Thr Trp Leu Glu Gly Ala Glu Ala Thr Ala Leu Ala 
675 680 685 

TGC AGT GCC AGA GGC CGC CCC TCT CCA CGC GTG CGC TGT TCC AGG GAA 
Cys Ser Ala Arg Gly Arg Pro Ser Pro Arg Val Arg Cys Ser Arg Glu 
690 695 700 

GGT GCA GCC AGG CTG GAG AGG CTA CAG GTG TCC CGA GAG GAT GCG GGG 
Gly Ala Ala Arg Leu Glu Arg Leu Gin Val Ser Arg Glu Asp Ala Gly 
705 710 715 720 

ACC TAC CTG TGT GTG GCT ACC AAC GCG CAT GGC ACG GAT TCA CGG ACC 
Thr Tyr Leu Cys Val Ala Thr Asn Ala His Gly Thr Asp Ser Arg Thr 
725 730 735 

GTC ACT GTG GGT GTG GAA TAC CGG CCT GTG GTG GCT GAG CTG GCA GCC 
Val Thr Val Gly Val Glu Tyr Arg Pro Val Val Ala Glu Leu Ala Ala 
740 745 750 

TCG CCC CCA AGC GTG CGG CCT GGC GGA AAC TTC ACT CTG ACC TGC CGT 
Ser Pro Pro Ser Val Arg Pro Gly Gly Asn Phe Thr Leu Thr Cys Arg 
755 760 765 

GCA GAG GCC TGG CCT CCA GCC CAG ATC AGC TGG CGC GCG CCC CCG GGA 
Ala Glu Ala Trp Pro Pro Ala Gin lie Ser Trp Arg Ala Pro Pro Gly 
770 775 780 

GCT CTC AAC CTC GGT CTC TCC AGC AAC AAC AGC ACG CTG AGC GTG GCG 
Ala Leu Asn Leu Gly Leu Ser Ser Asn Asn Ser Thr Leu Ser Val Ala 
785 790 795 800 

GGT GCC ATG GGC AGC CAT GGT GGC GAG TAT GAG TGC GCA GCC ACC AAT 
Gly Ala Met Gly Ser His Gly Gly Glu Tyr Glu Cys Ala Ala Thr Asn 
805 810 815 

GCG CAT GGG CGC CAC GCA CGG CGC ATC ACG GTG CGC GTG GCC GGT CCA 
Ala His Gly Arg His Ala Arg Arg He Thr Val Arg Val Ala Gly Pro 
820 825 830 

TGG CTG TGG GTC GCT GTG GGC GGT GCG GCA GGG GGC GCG GCG CTG CTG 
Trp Leu Trp Val Ala Val Gly Gly Ala Ala Gly Gly Ala Ala Leu Leu 
835 840 845 

GCC GCA GGG GCC GGC CTG GCC TTC TAC GTG CAG TCC ACC GCT TGC AAG 
Ala Ala Gly Ala Gly Leu Ala Phe Tyr Val Gin Ser Thr Ala Cys Lys 
850 855 860 

AAG GGA GAG TAC AAC GTC CAG GAG GCT GAG AGC TCA GGC GAG GCG GTG 
Lys Gly Glu Tyr. Asn Val Gin Glu Ala Glu Ser Ser Gly Glu Ala Val 
865 870 875 ' 880 

TGT CTC AAT GGC GCG GGC GGG ACA CCG GGT GCA GAA GGC GGA GCA GAG 
Cys Leu Asn Gly Ala Gly Gly Thr Pro Gly Ala Glu Gly Gly Ala Glu 
885 890 895 

ACC CCC GGC ACT GCC GAG TCA CCT GCA GAT GGC GAG GTT TTC GCC ATC 
Thr Pro Gly Thr Ala Glu Ser Pro Ala Asp Gly Glu Val Phe Ala He 
900 905 910 

CAG CTG ACA TCT TCC TGAGCCTGTA TCCAGCTCCC CCAGGGGCCT CGAAAGCACA 
Gin Leu Thr Ser Ser 
915 
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GGGGTGGACG TATGTATTGT TCACTCTCTA TTTATTCAAC TCCAGGGGCG TCGTCCCCGT 2911 
TTTCTACCCA TTCCCTTAAT AAAGTTTTTA TAGGAGAAAA AAAAAAAAAA AAAAAAAAAA 2971 
AAAAAAAAAA AAAAAAA 2988 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 917 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Pro Gly Pro Ser Pro Gly Leu Arg Arg Thr Leu Leu Gly Leu Trp 
15 10 15 

Ala Ala Leu Gly Leu Gly lie Leu Gly lie Ser Ala Val Ala Leu Glu 
20 25 30 

Pro Phe Trp Ala Asp Leu Gin Pro Arg Val Ala Leu Val Glu Arg Gly 
35 40 45 

Gly Ser Leu Trp Leu Asn Cys Ser Thr Asn Cys Pro Arg Pro Glu Arg 
50 * 55 60 

Gly Gly Leu Glu Thr Ser Leu Arg Arg Asn Gly Thr Gin Arg Gly Leu 
65 70 75 80 

Arg Trp Leu Ala Arg Gin Leu Val Asp lie Arg Glu Pro Glu Thr Gin 
85 90 95 

Pro Val Cys Phe Phe Arg Cys Ala Arg Arg Thr Leu Gin Ala Arg Gly 
100 105 110 

Leu lie Arg Thr Phe Gin Arg Pro Asp Arg Val Glu Leu Val Pro Leu 
115 120 125 

Pro Pro Trp Gin Pro Val Gly Glu Asn Phe Thr Leu Ser Cys Arg Val 
130 135 140 

Pro Gly Ala Gly Pro Arg Ala Ser Leu Thr Leu Thr Leu Leu Arg Gly 
145 150 155 ISO 

Gly Gin Glu Leu lie Arg Arg Ser Phe Val Gly Glu Pro Pro Arg Ala 
165 170 175 

Arg Gly Ala Met Leu Thr Ala Thr Val Leu Ala Arg Arg Glu Asp His 
180 185 190 

Arg Ala Asn Phe Ser Cys Leu Ala Glu Leu Asp Leu Arg Pro His Gly 
195 200 205 

Leu Gly Leu Phe Ala Asn Ser Ser Ala Pro Arg Gin Leu Arg Thr Phe 
210 215 220 

Ala Met Pro Pro Leu Ser Pro Ser Leu lie Ala Pro Arg Phe Leu Glu 
225 230 235 240 
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Val Gly Ser Glu Arg Pro Val Thr Cys Thr Leu Asp Gly Leu Phe Pro 
245 250 255 

Ala Pro Glu Ala Gly Val Tyr Leu Ser Leu Gly Asp Gin Arg Leu His 
260 265 270 

Pro Asn Val Thr Leu Asp Gly Glu Ser Leu Val Ala Thr Ala Thr Ala 
275 280 285 

Thr Ala Ser Glu Glu Gin Glu Gly Thr Lys Gin Leu Met Cys He Val 
290 295 300 

Thr Leu Gly Gly Glu Ser Arg Glu Thr Gin Glu Asn Leu Thr Val Tyr 
305 " 310 315 320 

Ser Phe Pro Ala Pro Leu Leu Thr Leu Ser Glu Pro Glu Ala Pro Glu 
325 330 335 

Gly Lys Met Val Thr Val Ser Cys Trp Ala Gly Ala Arg Ala Leu Val 
340 345 350 

Thr Leu Glu Gly He Pro Ala Ala Val Pro Gly Gin Pro Ala Glu Leu 
355 360 365 

Gin Leu Asn Val Thr Lys Asn Asp Asp Lys Arg Gly Phe Phe Cys Asp 
370 375 380 

Ala Ala Leu Asp Val Asp Gly Glu Thr Leu Arg Lys Asn Gin Ser Ser 
385 390 395 400 

Glu Leu Arg Val Leu Tyr Ala Pro Arg Leu Asp Asp Leu Asp Cys Pro 
405 410 415 

Arg Ser Trp Thr Trp Pro Glu Gly Pro Glu Gin Thr Leu His Cys Glu 
420 425 430 

Ala Arg Gly Asn Pro Glu Pro Ser Val His Cys Ala Arg Pro Asp Gly 
435 440 " 445 

Gly Ala Val Leu Ala Leu Gly Leu Leu Gly Pro Val Thr Arg Ala Leu 

450 455 460 

Ala Gly Thr Tyr Arg Cys Thr Ala He Asn Gly Gin Gly Gin Ala Val 
465 470 475 480 

Lys Asp Val Thr Leu Thr Val Glu Tyr Ala Pro Ala Leu Asp Ser Val 
485 490 495 

Gly Cys Pro Glu Arg He Thr Trp Leu Glu Gly Thr Glu Ala Ser Leu 
500 505 * 510 

Ser Cys Val Ala His Gly Val Pro Pro Pro Ser Val Ser Cys Val Arg 
515 520 525 

Ser Gly Lys Glu Glu Val Met Glu Gly Pro Leu Arg Val Ala Arg Glu 
530 535 540 

His Ala Gly Thr Tyr Arg Cys Glu Ala He Asn Ala Arg Gly Ser Ala 
545 550 555 560 

Ala Lys Asn Val Ala Val Thr Val Glu Tyr Gly Pro Ser Phe Glu Glu 
565 570 575 
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Leu Gly Cys Pro Ser Asn Trp Thr Trp Val Glu Gly Ser Gly Lys Leu 
580 585 590 

Phe Ser Cys Glu Val Asp Gly Lys Pro Glu Pro Arg Val Glu Cys Val 
595 600 605 

Gly Ser Glu Gly Ala Ser Glu Gly Val Val Leu Pro Leu Val Ser Ser 
610 615 620 

Asn Ser Gly Ser Arg Asn Ser Met Thr Pro Gly Asn Leu Ser Pro Gly 
625 630 635 640 

lie Tyr Leu Cys Asn Ala Thr Asn Arg His Gly Ser Thr Val Lys Thr 
645 650 655 

Val Val Val Ser Ala Glu Ser Pro Pro Gin Met Asp Glu Ser Ser Cys 
660 665 670 

Pro Ser His Gin Thr Trp Leu Glu Gly Ala Glu Ala Thr Ala Leu Ala 
675 680 685 

Cys Ser Ala Arg Gly Arg Pro Ser Pro Arg Val Arg Cys Ser Arg Glu 
690 695 700 

Gly Ala Ala Arg Leu Glu Arg Leu Gin Val Ser Arg Glu Asp Ala Gly 
705 710 715 720 

Thr Tyr Leu Cys Val Ala Thr Asn Ala His Gly Thr Asp Ser Arg Thr 
725 730 735 

Val Thr Val Gly Val Glu Tyr Arg Pro Val Val Ala Glu Leu Ala Ala 
740 745 750 

Ser Pro Pro Ser Val Arg Pro Gly Gly Asn Phe Thr Leu Thr Cys Arg 
755 760 765 

Ala Glu Ala Trp Pro Pro Ala Gin lie Ser Trp Arg Ala Pro Pro Gly 
770 775 • 780 

Ala Leu Asn Leu Gly Leu Ser Ser Asn Asn Ser Thr Leu Ser Val Ala 
785 790 795 800 

Gly Ala Met Gly Ser His Gly Gly Glu Tyr Glu Cys Ala Ala Thr Asn 
805 * " 810 ' 815 

Ala His Gly Arg His Ala Arg Arg He Thr Val Arg Val Ala Gly Pro 
820 825 830 

Trp Leu Trp Val Ala Val Gly Gly Ala Ala Gly Gly Ala Ala Leu Leu 
835 840 845 

Ala Ala Gly Ala Gly Leu Ala Phe Tyr Val Gin Ser Thr Ala Cys Lys 
850 855 860 

Lys Gly Glu Tyr Asn Val Gin Glu Ala Glu Ser Ser Gly Glu Ala Val 
865 870 875 880 

Cys Leu Asn Gly Ala Gly Gly Thr Pro Gly Ala Glu Gly Gly Ala Glu 
885 890 895 

Thr Pro Gly Thr Ala Glu Ser Pro Ala Asp Gly Glu Val Phe Ala He 
900 905 910 
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Gin Leu Thr Ser Ser 
915 

(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 315 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..315 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

CCG GAT CGG GTA GAG CTA GTG CCT CTG CCT CCT TGG CAG CCT GTA GGT 
Pro Asp Arg Val Glu Leu Val Pro Leu Pro Pro Trp Gin Pro Val Gly 
15 10 15 

GAG AAC TTC ACC TTG AGC TGC AGG GTC CCG GGG GCA GGA CCC CGA GCG 
Glu Asn Phe Thr Leu Ser Cys Arg Val Pro Gly Ala Gly Pro Arg Ala 
20 25 30 

AGC CTC ACA TTG ACC TTG CTG CGA GGC GGA CAG GAG CTG ATT CGC CGA 
Ser Leu Thr Leu Thr Leu Leu Arg Gly Gly Gin Glu Leu lie Arg Arg 
35 40 45 

AGT TTC GTA GGC GAG CCA CCC CGA GCT CGG TGT GCG ATG CTC ACC GCC 
Ser Phe Val Gly Glu Pro Pro Arg Ala Arg Cys Ala Met Leu Thr Ala 
50 55 ~ 60 

ACG GTC CTG GCG CGC AGA GAG GAT CAC AGG GAC AAT TTC TCA TGC CTC 
Thr Val Leu Ala Arg Arg Glu Asp His Arg Asp Asn Phe Ser Cys Leu 
65 70 75 80 

GCG GAG CTT GAC CTG CGG ACA CAC GGC TTG GGA CTG TTT GCA AAC AGC 
Ala Glu Leu Asp Leu Arg Thr His Gly Leu Gly Leu Phe Ala Asn Ser 
85 " 90 95 

TCA GCC CCC AGA CAG CTC CGC ACG TTT 
Ser Ala Pro Arg Gin Leu Arg Thr Phe 
100 105 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1781 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 16.. 1659 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
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CAGCTCTCTG TCAGA ATG GCC ACC ATG GTA CCA TCC GTG TTG TGG CCC AGG 
Met Ala Thr Met Val Pro Ser Val Leu Trp Pro Arg 
15 10 

GCC TGC TGG ACT CTG CTG GTC TGC TGT CTG CTG ACC CCA GGT GTC CAG 
Ala Cys Trp Thr Leu Leu Val Cys Cys Leu Leu Thr Pro Gly Val Gin 
15 20 25 

GGG CAG GAG TTC CTT TTG CGG GTG GAG CCC CAG AAC CCT GTG CTC TCT 
Gly Gin Glu Phe Leu Leu Arg Val Glu Pro Gin Asn Pro Val Leu Ser 
30 35 40 

GCT GGA GGG TCC CTG TTT GTG AAC TGC AGT ACT GAT TGT CCC AGC TCT 
Ala Gly Gly Ser Leu Phe Val Asn Cys Ser Thr Asp Cys Pro Ser Ser 
45 50 55 60 

GAG AAA ATC GCC TTG GAG ACG TCC CTA TCA AAG GAG CTG GTG GCC AGT 
Glu Lys lie Ala Leu Glu Thr Ser Leu Ser Lys Glu Leu Val Ala Ser 
65 70 75 

GGC ATG GGC TGG GCA GCC TTC AAT CTC AGC AAC GTG ACT GGC AAC AGT 
Gly Met Gly Trp Ala Ala Phe Asn Leu Ser Asn Val Thr Gly Asn Ser 
80 85 90 

CGG ATC CTC TGC TCA GTG TAC TGC AAT GGC TCC CAG ATA ACA GGC TCC 
Arg lie Leu Cys Ser Val Tyr Cys Asn Gly Ser Gin lie Thr Gly Ser 
95 100 105 

TCT AAC ATC ACC GTG TAC GGG CTC CCG GAG CGT GTG GAG CTG GCA CCC 
Ser Asn lie Thr Val Tyr Gly Leu Pro Glu Arg Val Glu Leu Ala Pro 
110 115 120 

CTG CCT CCT TGG CAG CCG GTG GGC CAG AAC TTC ACC CTG CGC TGC CAA 
Leu Pro Pro Trp Gin Pro Val Gly Gin Asn Phe Thr Leu Arg Cys Gin 
125 130 135 140 

GTG GAG GGT GGG TCG CCC CGG ACC AGC CTC ACG GTG GTG CTG CTT CGC 
Val Glu Gly Gly Ser Pro Arg Thr Ser Leu Thr Val Val Leu Leu Arg 
145 150 155 

TGG GAG GAG GAG CTG AGC CGG CAG CCC GCA GTG GAG GAG CCA GCG GAG 
Trp Glu Glu Glu Leu Ser Arg Gin Pro Ala Val Glu Glu Pro Ala Glu 
160 165 170 

GTC ACT GCC ACT GTG CTG GCC AGC AGA GAC GAC CAC GGA GCC CCT TTC 
Val Thr Ala Thr Val Leu Ala Ser Arg Asp Asp His Gly Ala Pro Phe 
175 180 185 

TCA TGC CGC ACA GAA CTG GAC ATG CAG CCC CAG GGG CTG GGA CTG TTC 
Ser Cys Arg Thr Glu Leu Asp Met Gin Pro Gin Gly Leu Gly Leu Phe 
190 195 200 

GTG AAC ACC TCA GCC CCC CGC CAG CTC CGA ACC TTT GTC CTG CCC GTG 
Val Asn Thr Ser Ala Pro Arg Gin Leu Arg Thr Phe Val Leu Pro Val 
205 210 215 220 

ACC CCC CCG CGC CTC GTG GCC CCC CGG TTC TTG GAG GTG GAA ACG TCG 
Thr Pro Pro Arg Leu Val Ala Pro Arg Phe Leu Glu Val Glu Thr Ser 
225 230 235 

TGG CCG GTG GAC TGC ACC CTA GAC GGG CTT TTT CCA GCC TCA GAG GCC 
Trp Pro Val Asp Cys Thr Leu Asp Gly Leu Phe Pro Ala Ser Glu Ala 
240 245 250 
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CAG GTC TAC CTG GCG CTG GGG GAC CAG ATG CTG AAT GCG ACA GTC ATG 819 
Gin Val Tyr Leu Ala Leu Gly Asp Gin Met Leu Asn Ala Thr Val Met 
255 260 265 

AAC CAC GGG GAC ACG CTA ACG GCC ACA GCC ACA GCC ACG GCG CGC GCG 867 
Asn His Gly Asp Thr Leu Thr Ala Thr Ala Thr Ala Thr Ala Arg Ala 
270 275 280 

GAT CAG GAG GGT GCC CGG GAG ATC GTC TGC AAC GTG ACC CTA GGG GGC 915 
Asp Gin Glu Gly Ala Arg Glu lie Val Cys Asn Val Thr Leu Gly Gly 
285 ' 290 295 300 

GAG AGA CGG GAG GCC CGG GAG AAC TTG ACG GTC TTT AGC TTC CTA GGA 963 
Glu Arg Arg Glu Ala Arg Glu Asn Leu Thr Val Phe Ser Phe Leu Gly 
305 310 315 

CCC ATT GTG AAC CTC AGC GAG CCC ACC GCC CAT GAG GGG TCC ACA GTG 1011 
Pro lie Val Asn Leu Ser Glu Pro Thr Ala His Glu Gly Ser Thr Val 
320 325 330 

ACC GTG AGT TGC ATG GCT GGG GCT CGA GTC CAG GTC ACG CTG GAC GGA 1059 
Thr Val Ser Cys Met Ala Gly Ala Arg Val Gin Val Thr Leu Asp Gly 
335 340 345 

GTT CCG GCC GCG GCC CCG GGG CAG ACA GCT CAA CTT CAG CTA AAT GCT 1107 
Val Pro Ala Ala Ala Pro Gly Gin Thr Ala Gin Leu Gin Leu Asn Ala 
350 355 360 

ACC GAG AGT GAC GAC GGA CGC AGC TTC TTC TGC AGT GCC ACT CTC GAG 1155 
Thr Glu Ser Asp Asp Gly Arg Ser Phe Phe Cys Ser Ala Thr Leu Glu 
365 " 370 375 380 

GTG GAC GGC GAG TTC TTG CAC AGG AAC AGT AGC GTC CAG CTG CGA GTC 1203 
Val Asp Gly Glu Phe Leu His Arg Asn Ser Ser Val Gin Leu Arg Val 
385 390 395 

CTG TAT GGT CCC AAA ATT GAC CGA GCC ACA TGC CCC CAG CAC TTG AAA 1251 
Leu Tyr Gly Pro Lys lie Asp Arg Ala Thr Cys Pro Gin His Leu Lys 
400 405 410 

TGG AAA GAT AAA ACG AGA CAC GTC CTG CAG TGC CAA GCC AGG GGC AAC 1299 
Trp Lys Asp Lys Thr Arg His Val Leu Gin Cys Gin Ala Arg Gly Asn 
415 420 425 

CCG TAC CCC GAG CTG CGG TGT TTG AAG GAA GGC TCC AGC CGG GAG GTG 1347 
Pro Tyr Pro Glu Leu Arg Cys Leu Lys Glu Gly Ser Ser Arg Glu Val 
430 435 440 

CCG GTG GGG ATC CCG TTC TTC GTC AAC GTA ACA CAT AAT GGT ACT TAT 1395 
Pro Val Gly He Pro Phe Phe Val Asn Val Thr His Asn Gly Thr Tyr 
445 450 455 460 

CAG TGC CAA GCG TCC AGC TCA CGA GGC AAA TAC ACC CTG GTC GTG GTG 1443 
Gin Cys Gin Ala Ser Ser Ser Arg Gly Lys Tyr Thr Leu Val Val Val 
465 470 475 

ATG GAC ATT GAG GCT GGG AGC TCC CAC TTT GTC CCC GTC TTC GTG GCG 1491 
Met Asp He Glu Ala Gly Ser Ser His Phe Val Pro Val Phe Val Ala 
480 485 490 

GTG TTA CTG ACC CTG GGC GTG GTG ACT ATC GTA CTG GCC TTA ATG TAC 1539 
Val Leu Leu Thr Leu Gly Val Val Thr He Val Leu Ala Leu Met Tyr 
495 500 505 



GTC TTC AGG GAG CAC CAA CGG AGC GGC AGT TAC CAT GTT AGG GAG GAG 1587 
Val Phe Arg Glu His Gin Arg Ser Gly Ser Tyr His Val Arg Glu Glu 
510 515 ' 520 

AGC ACC TAT CTG CCC CTC ACG TCT ATG CAG CCG ACA GAA GCA ATG GGG 1635 
Ser Thr Tyr Leu Pro Leu Thr Ser Met Gin Pro Thr Glu Ala Met Gly 
525 530 535 540 

GAA GAA CCG TCC AGA GCT GAG TGACGCTGGG ATCCGGGATC AAAGTTGGCG 1586 
Glu Glu Pro Ser Arg Ala Glu 
545 

GGGGCTTGGC TGTGCC CTCA GATTCCGCAC CAATAAAGC C TTCAAACTCC CAAAAAAAAA 1746 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAA 1781 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

CCGAACGCTC CTCGGCCTCT GGTCTNCTCT GGNCCTGGGG ATCCTAGGCA TCTCAGGTAA 60 

GAAGAGCCCG CCCGTGGAGC NAGGTGGATA AGGCGGGGGC GGAATTGAAG GACCAGAGAG 120 

GGCGGCCCGG GTGTCCCCCT CCAGGCTCCG CCCTCTTCTA GCTTCCCACG CTTCTGTCAC 180 

CACCTGGAGN TCGGGGCTTC TCCCCGTCCT TCCTCCACCC CAACACACCT CAATCTTTCA 24 0 

GANCTGAACC CAGCACCTTT TCTGGANTNG GGGNNTTGCA CCTAACCTGT CTCAGGAGAN 300 

ACTGTGGCTC TCCTGTCCTC TCCTGCTCTG TNATGCCCTA TGGTTCACAG ACTGGCATCA 360 

TCCCTATTCA TGATCCTCAA AGACNC CATC TCCTCAACTG TCATAACTCA GAGCTCTATT 420 

CCCCCTCCAC CTGGAGCCCT GGAAACCGGC TTTCTAGGGC TTTTCTC CGC GGTTCTTTCC 4 80 

CGGAGTTCAG CGTTGTGGCT TTTTGTCCAA GTTACTCAAG TTTGGGGACA ATCTCCTTTA 54 0 

AGCCTTTGAC TCAGTCTCAT TTC CACTTTG CTTTTGCCCC AAGCCTCTGT GTCTCTCCCC 600 

CATTTCCTGA CGATCTGTCA GAGTCTTAAG AGTGATTTGG TTCCCCATCC CCCCTCCAAC 660 

TGGAGTCTCC TCCTCACTAT TGATGTGTGC ATCTGAGACC CCCATCCCCG CACCGAGTTT 720 

CCCCATCTCT GTCAGTAAAG AGCAAGGCTT CCAGAGACAA CCCTCTAATA GCGCGTCAGT 780 

CCCGAATCTT GAGTGGGATG CGGGACTCCC GTGCTATTTC TTGGCGGAGG TCTTTCCTGG 840 

TCCTTATGGA CACCCCTGGT TTGGGATATG GGGGCCGCTA AGATTTCAGA GATGGGGTCC 900 

CTAGGCTGAG NCCGCGTTTT CCCGGGCAGC GGTCGCGCTA GAACCTTTCT GGGCGGACCT 960 

TCAGCCCCGC GTGGCGCTCG TGGAGCGCGG GGGCTCGCTG TGGCTCAACT GCAGCACTAA 1020 

CTGTCCGAGG CCGGAGCGCG GTGGCCTGGA GACCTCGCTA CGCCGAAACG GGACCCAGAG 1080 
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GGGTCTGNAC TGNCTGGCTC GACAGCTGGT GGACATCCGA GANCCTGAAA CCCAGCCGGT 1140 

CTGCTTCTTC CNCTGCGCGC GCCGCACACT CCAAGCGCGT GGGCTCATCC GAACTTTCCG 12 00 

TGAGTTCAGG GTGGGCACNC CCCTTGGGTC TCTGGACCTC CCCCTCAAGC TCCTCCCACC 1260 

CGCCCTCTGA TCCTCCTGCT TGTTCTGAAA GTACTACAGC TGGCTAGAGC GGAGTTTTTG 1320 

GTCCCTTGCA GAG CGACCGG ATCGGGTAGA GCTAGTGCCT CTGCCTCCTT GGCAGCCTGT 1380 

AGGTGAGAAC TTCACCTTGA GCTGCAGGGT CCCGGGGGCA GGACCCCGAG CGAGCCTCAC 144 0 

ATTGACCTTG CTGCGAGGCG GCCAGGAGCT GATTCGCCGA AGTTTCGTAG GCGAGCCACC 1500 

CCGAGCTCGG GGTGCGATGC TCACCGCCAC GGTCCTGGCG CGCAGAGAGG ATCACAGGGC 1560 

CAATTTCTCA TGCCTCGCGG AG CTTGACCT GCGNCCACAC GGCTTGGGAC TGTTTGCANA 1620 

CAGCTCAGCC CCCAGACAGC TCCGCACGTT TGGTGAGTGT GGACCCTAAC TGACAGATTT 1680 

TAAGAAGTTT AGGGCAGCCA GGCGTGGTGG CATGGTGTCG TAGGCCCTAA GTCCCAGCCC 174 0 

AAGCAGANCT AAGNCGGATC TCTTGTGAAT TAAAAGTCTA GCTCGTCTAC ATAACGAGGN 1800 

CTG CATAGTT AAATCCCCCA AAAGTCTAAG CAGCTAGCCC TTACTTCCAA CACAAGTACT 1860 

AGCTTAAGTA CTTTCTCCTG TGAG CTTTTT CCTTTATGTA TTTACTCGTT GAGAGAAAAA 1920 

GAGAGTGTGT GTACGTGCCT TTATGCACAT GCCGCAGTGC TTGTATGGAA GTTAAAGAAT 19 80 

AAGGAGGCGT TCTGCCCTTC CATCCTGTGG GTCCTAGGGG TGGTATTAGC TCCTCAGGCT 204 0 

TTGTTAGTNA CAAGCGCCTA GGCTTGGGGA GCCATCTCGC CCGCTCCTCT GTATCTTTAG 2100 

GGTGAAACCA GACAATGCAT GCAAATTGGT TGATCAACAC TGAATGTTTA GTTCGTAAAT 2160 

TCAAGCTCTG TTCTTTGTCT TCCTCAGCCA TGCCTCCACT TTCCCCCGAG CCTTATTGCC 2220 

CCACGATTCT TAGAAGTGGG CTCAGAAAGG CCGGTGACKT GCACTTTGGA TGGACTGTTT 2280 

CCTGCCCCAG AAGCCGGGGT TTACTTCTCT CTGGGAGATC AGAGGCTTCA TCCTAATGTG 234 0 

ACCCTCGACG GGGAGAGCCT TGTGGCCACT GCCACAGCTA CAG CAAGTGA AGAACAGGAA 2400 

GGCACCAAAC AG CTGATGTG CATCGTGACC CTCGGGGGCG AAAGCAGGGA GACCCAGGAA 24 60 

AACCTGACTG TCTACAGTAA GGGGAATCCA ACAAGACCTT CAATAG CTCA GACTGGGGCT 2520 

GGGGCTGGGT CTGGGTCTGG GGCCAGAGTC TCACAAAGGC GGAGCCTATA AAGTGGGCGG 2580 

GACCTCCACA C CAGAACAAG CCGGGCGGGA GAGTTCCAGG GCAGGAGCAG ATAGAAGTTG 2640 

GAAATTAATA GATTGGGTTG AGTTCCCTGA GTGGGGAGTG AACCCCACCC AATTCTCTGT 2 700 

CCCCAGGCTT CCCGGCTCCT CTTCTGACTT TAAGTGAGCC AGAAGCCCCC GAGGGAAAGA 2760 

TGGTGACCGT AAGCTGCTGG GCAGGGGCCC GAGCCCTTGT CACCTTGGAG GGAATTCCAA 2820 

GGACCCTCTT ACCGGCCCCA TCTTTAACCT TATCGTATCC CCTCTGCCTC ATGCCCGCAG 2880 

ACGCACCTCG GCTGGATGAC TTGGACTGTC CCAGGAGCTG GACGTGGCCA GAGGGTCCAG 2940 

AGCAGACCCT CCACTGCGAG GCCCGTGGAA ACCCTGAGCC CTCCGTGCAC TGTGCAAGGC 3000 
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CTGACGGTGG 


GGCGGTGCTA 


GCGCTGGGCC 


TGTTGGGTCC 


AGTGACCCGT 


GCCCTCGCGG 


3060 


GCACTTACCG 


ATGTACAGCA ATCAATGGGC 


AAGGCCAGGC 


GGTCAAGGAT 


GTGACCCTGA 


3120 


CTGTGGAATG 


TGAGTAGGGG 


GAGGTGGGCA 


TGCTTATCCC 


TTTAAGGTCA 


CGGAGTGTAC 


3180 


TGGGAGACTG 


GCTATACGGA AAGGAAAGAA 


GCCTAGGTTC 


AGCAGGGATT 


GGGAAAACAC 


3240 


TGAAGGAAAG 


TGGTGTGGTG 


TTTACAAACT 


TAACGGTGGT 


AACTGGGCAC 


GGTCTGGCAA 


3300 


AAACAGACAG 


CCAAGAGAGT 


GTGCCTGGGA 


AGCTGCAATG 


GGGGCTTTGT 


GGGAATTGGT 


3360 


CAACAGCACC 


CTGAGATCTC 


AGGAAAGGGG 


CCTGAAGTTA 


TCTCCAGAAC 


CCATGTGAAG 


3420 


GCAGGAAGAG 


AGAACGCCCA 


CCTTTTCCTG 


CTCCCCCCAA 


CCCCCCCCCA 


CATATCACAC 


3480 


GGAGTATATA 


AATAAATAAA 


ATGGCTCCTG 


CCGGAGGGAG 


TGAGAAGCTG 


TCTCCTGCAG 


3540 


GCTCAGAGCA 


GTGGTAGTGC 


ATGCCTTTAA 


TCCCAGCACT 


CGGTAGGCAA 


AGGCAGGCAG 


3600 


ATCTCTGTGA ATGTGGGGCC 


AGCCTGGTCT 


GTACAGAGAA 


ATC CTGTCTC 


AAAACAAACC 


3660 


AGCAAAGAAA 


CAAAACCAAA ATCAATTCCA 


GATGCCCCAG 


CGCTGGACAG 


TGTAGGCTGC 


3720 


CCANGACGTA 


TTACTTGNCT 


GGAGGGGACA 


GAGGCATCGC 


TTAGCTGTGT 


GGCACACGGG 


3780 


GTCCCACCAC 


CTAGCGTGAG 


CTGTGTGCGC 


TCTGGAAAGG 


AGGAAGTCAT 


GGAAGGGCCC 


3840 


CTGCGTGTGG 


CCCGGGAGCA 


CGCTGGCACT 


TACCGATGCG 


AAGCCATCAA 


CGCCAGGGGA 


3900 


TCAGCGGNCA AAAATGTGGC 


TGTCACGGTG 


GAATGTGAGT 


AGG GGTGGCT 


ACGGAAATGT 


3960 


CCACACCTGC 


GTCCTCTGTC 


CTCAGTGTGA 


ACTCCTATTT 


CCCTGCTTCC 


TAGATGGTCC 


4020 


CAGTTNTGAG 


GAGTTGGGCT 


GCCCCAGCAA 


CTGGACTTGG 


GTAGAAGGAT 


CTGGAAAACT 


4080 


GTTTTCCTGT 


GAAGTTGATG 


GGAAGCCGGA 


ACCACGCGTG 


GAGTGCGTGG 


GCTCGGAGGG 


4140 


TGCAAGCGAA 


GGGGTAGTGT 


TGCCCCTGGT 


GTCCTCGAAC 


TCTGGTTCCA 


GAAACTCTAT 


4200 


GACTCCTGGT 


AACCTGTCAC 


CGGGTATTTA 


CCTCTGCAAC 


GCCACCAACC 


GGCATGGCTC 


4260 


CACAGTCAAA 


ACAGTCGTCG 


TGAGCGCGGA 


ATGTGAGCAG 


GGGCCCAGGT 


GGGCGGAGAG 


4320 


TACCGGGTGT 


CCCAGGATCT 


TTTCTTTCCC 


TGATGCCCCT 


C CTTATGGTG 


GCTGATCTGC 


4380 


AGCACCGCCA 


CAGATGGATG 


AATCCAGTTG 


CCCGAGTCAC 


CAGACATGGC 


TGGAAGGAGC 


4440 


CGAGGCTACT 


GCGCTGGCCT 


GCAGTGACAG 


GGGNCGCCCC 


TCTCCACGCG 


TGCGCTGTTC 


4500 


CAGGGAAGGT 


GCAGCCAGGC 


TGGAGAGGCT ACAGGTGTCC 


CGAGAGGATG 


CGGGGACCTA 


4560 


CCTGTGTGTG 


GCTAC CAACG 


CGCATGGCAC 


GGATTCACGG 


AC CGTCACTG 


TGGGTGTGGA 


4620 


ATGTGAGTGA 


GGACAGCGCT 


GAATGAAGAC 


GACTCAGACC 


GC CAGAAAAG 


TGCCTTGAGG 


4680 


CCTGGGATGT 


ATGATCCAGT 


GGGTAGAGTG 


CTCAATTAGC 


AC TCACTAAA 


ATGTATATTC 


4740 


TATTCCTAAT 


ACTCTTTAAT 


TTTANCCTTT 


GGGAGGCAGA 


GACAGGCAGA 


TCTCTGTTCC 


4800 


GGGATAACCT 


GCTCTCTGTC 


TAGGACAGCT 


TGGTCTACAG 


AGGGGNTACA 


GGCCCCCCCT 


4860 


CCCAAGATTG 


NATAGCAACC 


CTCTGGCTCC 


CTGTCTCTCT 






4900 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1295 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



NGAATTCCGG 


CGGATCGGGT 


AGAGCTAGTG 


CCTCTGCCTC 


CTTGGCAGCC 


TGTAGGTGAG 


60 


AACTTCACCT 


TGAGCTGCAG 


GGTCCCGGGG 


GCAGGACCCC 


GAGCGAGCCT 


CACATTGAC C 


120 


TTGCTGCGAG 


GCGGCCAGGA 


GCTGATTCGC 


CGAAGTTTCG 


TAGGCGAGCC 


ACCCCGAGCT 


180 


CGGGGTGCGA 


TGCTCACCGC 


CACGGTCCTG 


GCGCGCAGAG 


AGGATCACAG 


GGCCAATTTC 


240 


TCATGCCTCG 


CGGAGCTTGA 


CCTGCGGCCA 


CACGGCTTGG 


GACTGTTTGC 


AAACAGCTCA 


300 


GCCCCCAGAC 


AGCTCCGCAC 


GTTTGCCATG 


CCTCCACTTT 


CCCCGAGCCT 


TATTGCCCCA 


360 


CGATTCTTAG 


AAGTGGGCTC 


AGAAAGGCCG 


GTGACTTGCA 


CTTTGGATGG 


ACTGTTTCCT 


420 


GCCCCAGAAG 


CCGGGGTTTA 


CCTCTCTCTG 


GGAGATCAGA 


GGCTTCATCC 


TAATGTGACC 


480 


CTCGACGGGG 


AGAGCCTTGT 


GGCCACTGCC 


ACAG CTACAG 


CAAGTGAAGA 


ACAGGAAGGC 


540 


AC CAAACAGC 


TGATGTGCAT 


CGTGACCCTC 


GGGGGCGAAA 


GCAGGGAGAC 


CCAGGAAAAC 


600 


CTGACTGTCT 


ACAGCTTCCC 


GGCTCCTCTT 


CTGACTTTAA 


GTGAGCCAGA 


AGCCCCCGAG 


660 


GGAAAGATGG 


TGACCGTAAG 


CTGCTGGGCA 


GGGGCCCGAG 


CCCTTGTCAC 


CTTGGAGGGA 


720 


ATTCCAAGGA 


CCCTCTTACC 


GGCCCCATCT 


TTAACCTTAT 


CGTATCCCCT 


CTGCCTCATG 


780 


CCCGCAGACG 


CACCTCGGCT 


GGATGACTTG 


GACTGTCCCA 


GGAGCTGGAC 


GTGGC CAGAG 


840 


GGTCCAGAGC 


AGACCCTCCA 


CTGCGAGGCC 


CGTGGAAACC 


CTGAGCCCTC 


CGTGCACTGT 


900 


GCAAGGCCTG 


ACGGTGGGGC 


GGTGCTAGCG 


CTGGGCCTGT 


TGGGTCCAGT 


GACCCGTGCC 


960 


CTCGCGGGCA 


CTTACCGATG 


TACAG CAATC 


AATGGGCAAG 


GCCAGGCGGT 


CAAGGATGTG 


1020 


ACCCTGACTG 


TGGAATATGC 


CCCAGCGCTG 


GACAGTGTAG 


GCTGCCCAGA 


ACGTATTACT 


1080 


TGGCTGGAGG 


GGACAGAGGC 


ATCGCTTAGC 


TGTGTGGCAC ACGGGGTCCC 


ACCACCTAGC 


1140 


GTGAGCTGTG 


TGCGCTCTGG 


AAAGGAGGAA 


GTCATGGAAG 


GGCCCCTGCG 


TTTTGGCCGG 


1200 


GAGCACGCTG 


GCACTTACCG 


ATGCGAAGCC 


ATCAACGCCA 


GGGGATCAGC 


GGCCAAAAAT 


1260 


GTGGCTGTCA 


CGGTGGAATA 


TGGTCCCCGG 


AATTC 






1295 


(2) INFORMATION FOR SEQ ID NO: 7: 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2214 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



CGAATCTTGA 


GTGGGATGCG 


GGACTCCCGT 


GCTATTTCTT 


GGCGGAGGTC 


TTTCCTGGTC 


60 


CTTATGGACA 


CCCCTGGTTT 


GGGATATGGG 


GGCCGCTAAG 


ATTTCAGAGA 


TGGGGTCCCT 


120 


AGGCTGAGCC 


CGCGTTTTCC 


CGGGCAGCGG 


TCGCGCTAGA ACCTTTCTGG 


GCGGACCTTC 


180 


AGCCCCGCGT 


GGCGCTCGTG 


GAGCGCGGGG 


GCTCGCTGTG 


GCTCAACTGC 


AG CACTAACT 


240 


GTCCGAGGCC 


GGAGCGCGGT 


GGYCTGGAGA 


CCTCGCTACG 


CCGAAACGGG 


ACCCAGAGGG 


300 


GTCTGCGCTG 


GCTGGCTCGA 


CAGMTGGTGG 


ACATCCGAGA 


GCCTGAAACC 


CAGTCGGTCT 


360 


GCTTCTTCCG 


CTGGGCGCGC 


CGCACACTCC 


AAGNGAGTGG 


GCTCATCCGA ACTTTCCAGC 


420 


GACCGGATCG 


GGTAGAGCTA 


GTGCCTCTGN 


CTCCTTGGCA 


GCCTGTAGGT 


GAGAACTTCA 


480 


CCTTGAGCTG 


CAGGGTCCCG 


GGGGCAGGAC 


CCCGAGCGAG 


CCTCACATTG 


ACCTTGCTGC 


540 


GAGGCGGCCA 


GGAGCTGATT 


CGCCGAAGTT 


TCGTAGGCGA 


GCCACCCCGA 


GCTCGGGGTG 


600 


CGATGCTCAC 


CGCCACGGTC 


CTGGCGCGCA 


GAGAGGATCA 


CAGGG C CAAT 


TTCTCATGCC 


660 


TCGCGGAGCT 


TGACCTGCGG 


ACACACGGCT 


TGGGACTGTT 


TGCAAACAGC 


TCAGCCCCCA 


720 


GACAGCTCCG 


CACGTTTGGC 


ATGCCTCCAC 


TTTCCCCGAG 


CCTTATTGNC 


CCACGATTCT 


780 


TAGAAGTGGG 


CT CAGAAAGG 


CCGGTGACTT 


GCACTTTGGA 


TGGACTGTTT 


CCTGCCCCAG 


840 


AAGCCGGGGT 


TTACCTCTCT 


CTGGGAGATC 


AGAGGCTTCA 


TCCTAATGTG 


ACCCTCGACG 


900 


GGGAGAGCCT 


TGTGGCCACT 


GNCACAGMTA 


CAGCAAGTGA 


AGAACAGGAA 


GGCACCAAAC 


960 


AGCTGATGTG 


CATCGTGACC 


CTCGGGGGCG 


AAAGCAGGGA GACCCAGGAA AACCTGACTG 


1020 


TCTACAGCTT 


CCCGGCTCCT 


CTTCTGACTT 


TAAGTGAGCC 


AGAAGCCCCC 


GAGG GAAAGA 


1080 


TGGTGACCGT AAGCTGCTGG 


GCAGGGGCCC 


GAGCCCTTGT 


CACCTTGGAG 


GGAATTCCAG 


1140 


CTGCGGTCCC 


TGGGCAGCCC 


GCTGAGCTCC 


AGTTAAATGT 


CACAAAGAAT 


GACGACAAGC 


1200 


GGGGCTTCTT 


CTGCGACGCT 


GCCCTCGATG 


TGGACGGGGA 


AACTCTGAGA 


AAGAACCAGA 


1260 


GCTCTGAGCT 


TCGTGTTCTG 


TACGCACCTC 


GGCTGGATGA 


CTTGGACTGT 


CCCAGGAGCT 


1320 


GGACGTGGCC 


AGAGGGTCCA GAGCAGACCC 


TCCACTGCGA GGCCCGTGGA AACCCTGAGC 


1380 


CCTCCGTGCA 


CTGTGCAAGG 


CCTGACGGTG 


GGGCGGTGCT 


AGCGCTGGGC 


CTGTTGGGTC 


1440 


CAGTGACCCG 


TGCCCTCGCG 


GGAACTTACC 


GATGTACAGC 


AATCAATGGG 


CAAGGCCAGG 


1500 


CGGTCAAGGA 


TGTGACCCTG 


ACTGTGGAAT 


ATGCCCCAGC 


GCTGGACAGT 


GTAGGCTGCC 


1560 


CAGAACGTAT 


TACTTGGCTG 


GAGGGGACAG 


AGGCATCGCT 


TAGCTGTGTG 


GCACACGGGG 


1620 


TCCCACCACC 


TAG CGTGAGC 


TGTGTGCGCT 


CTGGAAAGGA 


GGAAGTCATG 


GAAGGGCCCC 


1680 


TGCGTGTGGC 


CCGGGAGCAC 


GCTGGCACTT 


ACCGATGCGA 


AGCCATCAAC 


GNCAGGGGAT 


1740 


CAGCGGWCAA AAATGTGGCT 


GTCACGGTGG AATATGGTCC 


CAGTTTGGAG 


GAGTTGGGCT 


1800 
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GCCCCAGYAA CTGGACTTGG GTAGAAGGAT CTGGAAAACT GTTTTCCTGT GAAGTTGATG 1860 

GGAAGCCGGA ACCACGCGTG GAGTGCGTGG GCTCGGAGGG TGCAAGCGAA GGGGTAGTGT 1920 

TGCCCCTGGT GTCCTCGAAC TCTGGTTCCA GAAACTCTAT GACTCCTGGT AACCTGTCAC 1980 

CGGGTATTTA CCTCTGCAAC GCCACCAACC GGMATGGNTC CACAGTCAAA ACAGTCGTCG 2040 

TGAGCGCGGA ATCACCGCCA CAGATGGATG AATCCAGTTG CCCGAGTCAC CAGACATGGN 2100 

TGGAAGGAGC CGAGGNTACT GCGCTGGCCT GCAGTGCCAG AGGNCGCCCC TCTCCACGCG 2160 

TGCGCTGTTC CAGGGAAGGT GCAGMCAGGC TGGAGAGGNT ACAGGTGTCC CGAG 2214 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5077 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

CCGAACGCTC CTCGGCCTCT GGTCTNCTCT GGNCCTGGGG ATCCTAGGCA TCTCAGGTAA 60 

GAAGAGCCCG CCCGTGGAGC NAGGTGGATA AGGCGGGGGC GGAATTGAAG G AC CAGAG AG 120 

GGCGGCCCGG GTGTCCCCCT CCAGGCTCCG CCCTCTTCTA GCTTCCCACG CTTCTGTCAC 180 

CACCTGGAGN TCGGGGCTTC TCCCCGTCCT TCCTCCACCC CAACACACCT CAATCTTTCA 24 0 

GANCTGAACC CAGCACCTTT TCTGGANTNG GGGNNTTG CA CCTAACCTGT CTCAGGAGAN 300 

ACTGTGGCTC TCCTGTCCTC TCCTGCTCTG TNATGCCCTA TGGTTCACAG ACTGGCATCA 360 

TCCCTATTCA TGATCCTCAA AGACNCCATC TCCTCAACTG TCATAACTCA GAGCTCTATT 42 0 

CCCCCTCCAC CTGGAGCCCT GGAAACCGGC TTTCTAGGGC TTTTCTCCGC GGTTCTTTCC 480 

CGGAGTTCAG CGTTGTGGCT TTTTGTCCAA GTTACTCAAG TTTGGGGACA ATCTCCTTTA 540 

AGC CTTTGAC TCAGTCTCAT TTCCACTTTG CTTTTGCCCC AAGCCTCTGT GTCTCTCCCC 600 

CATTTCCTGA CGATCTGTCA GAGTCTTAAG AGTGATTTGG TTCCCCATCC CCCCTCCAAC 660 

TGGAGTCTCC TCCTCACTAT TGATGTGTGC ATCTGAGACC CCCATCCCCG CACCGAGTTT 720 

CCCCATCTCT GTCAGTAAAG AGCAAGGCTT C CAGAG ACAA CCCTCTAATA GCGCGTCAGT 780 

CCCGAATCTT GAGTGGGATG CGGGACTCCC GTGCTATTTC TTGGCGGAGG TCTTTCCTGG 840 

TCCTTATGGA CACCCCTGGT TTGGGATATG GGGGCCGCTA AGATTTCAGA GATGGGGTCC 900 

CTAGGCTGAG NCCGCGTTTT CCCGGGCAGC GGTCGCGCTA GAACCTTTCT GGGCGGACCT 960 

TCAGCCCCGC GTGGCGCTCG TGGAGCGCGG GGGCTCGCTG TGGCTCAACT GCAGCACTAA 1020 

CTGTCCGAGG CCGGAGCGCG GTGGCCTGGA GACCTCGCTA CGCCGAAACG GGACCCAGAG 1080 

GGGTCTGNAC TGNCTGGCTC GACAGCTGGT GGACATC CGA GANCCTGAAA CCCAGCCGGT 114 0 
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CTGCTTCTTC 


CNCTGCGCGC 


GCCGCACACT 


CCAAGCGCGT 


GGGCTCATCC GAACTTTCCG 


1200 


TGAGTTCAGG 


GTGGGCACNC 


CCCTTGGGTC 


TCTGGACCTC 


CCCCTCAAGC TCCTCCCACC 


1260 


CGCCCTCTGA 


TCCTCCTGCT 


TGTTCTGAAA 


GTACTACAGC 


TGGCTAGAGC GGAGTTTTTG 


1320 


GTCCCTTGCA 


GAGCGACCGG 


ATCGGGTAGA 


GCTAGTGCCT 


CTGCCTCCTT GGCAGCCTGT 


1380 


AGGTGAGAAC 


TTCACCTTGA 


GCTGCAGGGT 


CCCGGGGGCA 


GGACCCCGAG CGAGCCTCAC 


1440 


ATTGAC CTTG 


CTGCGAGGCG 


GCCAGGAGCT 


GATTCGCCGA AGTTTCGTAG GCGAGCCACC 


1500 


CCGAGCTCGG 


GGTGCGATGC 


TCACCGCCAC 


GGTCCTGGCG 


CGCAGAGAGG ATCACAGGGC 


1560 


CAATTTCTCA 


TGCCTCGCGG 


AGCTTGACCT 


GCGNCCACAC 


GGCTTGGGAC TGTTTGCANA 


1620 


CAGCTCAGCC 


CCCAGACAGC 


TCCGCACGTT 


TGGTGAGTGT 


GGACCCTAAC TGACAGATTT 


1680 


TAAGAAGTTT 


AGGGCAGCCA 


GGCGTGGTGG 


CATGGTGTCG 


TAGG CCCTAA GTCCCAGCCC 


1740 


AAGCAGANCT 


AAGNCGGATC 


TCTTGTGAAT 


TAAAAGTCTA 


GCTCGTCTAC ATAACGAGGN 


1800 


CTG CATAGTT 


AAATCCCCCA AAAGTCTAAG 


CAGCTAGCCC 


TTACTTCCAA CACAAGTACT 


1860 


AG CTTAAGTA 


CTTTCTCCTG 


TG AG CTTTTT 


CCTTTATGTA 


TTTACTCGTT GAGAGAAAAA 


1920 


GAGAGTGTGT 


GTACGTGCCT 


TTATG CACAT 


GCCGCAGTGC 


TTGTATGGAA GTTAAAGAAT 


1980 


AAGGAGGCGT 


TCTGCCCTTC 


CATCCTGTGG 


GTCCTAGGGG 


TGGTATTAGC TCCTCAGGCT 


2040 


TTGTTAGTNA 


CAAGCGCCTA 


GGCTTGGGGA 


GCCATCTCGC 


CCGCTCCTCT GTATCTTTAG 


2100 


GGTGAAACCA 


GACAATG CAT 


GCAAATTGGT 


TGATCAACAC 


TGAATGTTTA GTTCGTAAAT 


2160 


TCAAGCTCTG 


TTCTTTGTCT 


TCCTCAGCCA 


TGCCTCCACT 


TTCCCCCGAG CCTTATTGCC 


2220 


CCACGATTCT 


TAGAAGTGGG 


CTCAGAAAGG 


CCGGTGACKT 


GCACTTTGGA TGGACTGTTT 


2280 


CCTGCCCCAG 


AAGCCGGGGT 


TTACTTCTCT 


CTGGGAGATC 


AGAGGCTTCA TCCTAATGTG 


2340 


ACCCTCGACG 


GGGAGAGCCT 


TGTGGCCACT 


GCCACAGCTA 


CAGCAAGTGA AGAACAGGAA 


2400 


GGCACCAAAC 


AGCTGATGTG 


CATCGTGACC 


CTCGGGGGCG AAAGCAGGGA GACCCAGGAA 


2460 


AAC CTGACTG 


TCTACAGTAA 


GGGGAATCCA 


ACAAGACCTT 


CAATAGCTCA GACTGGGGCT 


2520 


GGGGCTGGGT 


CTGGGTCTGG 


GGCCAGAGTC 


TCACAAAGGC 


GGAGCCTATA AAGTGGGCGG 


2580 


GACCTCCACA 


CCAGAACAAG 


CCGGGCGGGA 


GAGTTCCAGG 


GCAGGAGCAG ATAGAAGTTG 


2640 


GAAATTAATA 


GATTGGGTTG 


AGTTCCCTGA 


GTGGGGAGTG 


AACCCCACCC AATTCTCTGT 


2700 


CCCCAGGCTT 


CCCGGCTCCT 


CTTCTGACTT 


TAAGTGAGCC 


AGAAGCCCCC GAGGGAAAGA 


2760 


TGGTGACCGT AAGCTGCTGG 


GCAGGGGCCC 


GAGCCCTTGT 


CACCTTGGAG GGAATTCCAG 


2820 


CTGCGGTCCC 


TGGGCAGCCC 


GCTGAGCTCC 


AGTTAAATGT 


CACAAAGAAT GACGACAAGC 


2880 


GGGGCTTCTT 


CTGCGACGCT 


GCCCTCGATG 


TGGACGGGGA AACTCTGAGA AAGAACCAGA 


2940 


GCTCTGAGCT 


TCGTGTTCTG 


TGTGAGTGGA 


TGTTCACTTT 


ATCTCTGTGA ATTCCAAGGA 


3000 


CCCTCTTACC < 


GGCCCCATCT ' 


TTAAC CTTAT 


CGTATCCCCT 


CTGCCTCATG CCCGCAGACG 


3060 
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CACCTCGGCT 


GGATGACTTG 


GACTGTCCCA 


GGAGCTGGAC 


GTGGCCAGAG 


GGTC CAGAGC 


3120 


AGACCCTCCA 


CTGCGAGGCC 


CGTGGAAACC 


CTGAGCCCTC 


CGTGCACTGT 


GCAAGGCCTG 


3180 


ACGGTGGGGC 


GGTGCTAGCG 


CTGGGCCTGT 


TGGGTCCAGT 


GACCCGTGCC 


CTCGCGGGCA 


3240 


CTTACCGATG 


TACAG CAATC 


AATGGGCAAG 


GCCAGGCGGT 


CAAGGATGTG 


ACCCTGACTG 


3300 


TGGAATGTGA 


GTAGGGGGAG 


GTGGGCATGC 


TTATCCCTTT 


AAGGTCACGG 


AGTGTACTGG 


3360 


GAGACTGGCT 


ATACGGAAAG 


GAAAGAAGCC 


TAGGTTCAGC 


AGGGATTGGG 


AAAACACTGA 


3420 


AGGAAAGTGG 


TGTGGTGTTT 


ACAAACTTAA 


CGGTGGTAAC 


TGGGCACGGT 


CTGGCAAAAA 


3480 


CAGACAGCCA 


AGAGAGTGTG 


CCTGGGAAGC 


TGCAATGGGG 


GCTTTGTGGG 


AATTGGTCAA 


3540 


CAGCACCCTG 


AGATCTCAGG 


AAAGGGGCCT 


GAAGTTATCT 


CCAGAACCCA 


TGTGAAGGCA 


3600 


GGAAGAGAGA 


ACGCCCACCT 


TTTCCTGCTC 


CCCCCAACCC 


CCCCCCACAT 


ATCACACGGA 


3660 


GTATATAAAT 


AAATAAAATG 


GCTCCTGCCG 


GAGGGAGTGA 


GAAGCTGTCT 


CCTGCAGGCT 


3720 


CAGAGCAGTG 


GTAGTGCATG 


CCTTTAATCC 


CAGCACTCGG 


TAGGCAAAGG 


CAGGCAGATC 


3780 


TCTGTGAATG 


TGGGGCCAGC 


CTGGTCTGTA 


CAGAGAAATC 


CTGTCTCAAA 


ACAAACCAGC 


3840 


AAAGAAACAA 


AACCAAAATC 


AATTC CAG AT 


GCCCCAGCGC 


TGGACAGTGT 


AGGCTGCCCA 


3900 


NGACGTATTA 


CTTGNCTGGA 


GGGGACAGAG 


GCATCGCTTA 


GCTGTGTGGC 


ACACGGGGTC 


3960 


CCACCACCTA 


GCGTGAGCTG 


TGTGCGCTCT 


GGAAAGGAGG 


AAGTCATGGA 


AGGGCCCCTG 


4020 


CGTGTGGCCC 


GGGAGCACGC 


TGGCACTTAC 


CGATGCGAAG 


CCATCAACGC 


CAGGGGATCA 


4080 


GCGGNCAAAA 


ATGTGGCTGT 


CACGGTGGAA 


TGTGAGTAGG 


GGTGGCTACG 


GAAATGTCCA 


4140 


CACCTGCGTC 


CTCTGTCCTC 


AGTGTGAACT 


CCTATTTCCC 


TGCTTCCTAG 


ATGGTCCCAG 


4200 


TTNTGAGGAG 


TTGGGCTGCC 


CCAGCAACTG 


GACTTGGGTA 


GAAGGATCTG 


GAAAACTGTT 


4260 


TTCCTGTGAA 


GTTGATGGGA AGCCGGAACC 


ACGCGTGGAG 


TGCGTGGGCT 


CGGAGGGTGC 


4320 


AAGCGAAGGG 


GTAGTGTTGC 


CCCTGGTGTC 


CTCGAACTCT 


GGTTCCAGAA 


ACTCTATGAC 


4380 


TCCTGGTAAC 


CTGTCACCGG 


GTATTTACCT 


CTGCAACGCC 


ACCAACCGGC 


ATGGCTCCAC 


4440 


AGTCAAAACA 


GTCGTCGTGA GCGCGGAATG TGAGCAGGGG 


CCCAGGTGGG 


CGGAGAGTAC 


4500 


CGGGTGTCCC 


AGGATCTTTT 


CTTTCCCTGA 


TGCCCCTCCT 


TATGGTGGCT 


GATCTGCAGC 


4560 


ACCGCCACAG 


ATGGATGAAT 


CCAGTTGCCC 


GAGTCACCAG 


ACATGGCTGG 


AAGGAGCCGA 


4620 


GGCTACTGCG 


CTGGCCTGCA 


GTGACAGGGG 


NCGCCCCTCT 


CCACGCGTGC 


GCTGTTCCAG 


4680 


GGAAGGTGCA 


GCCAGGCTGG 


AGAGGCTACA 


GGTGTCCCGA 


GAGGATGCGG 


GGACCTACCT 


4740 


GTGTGTGGCT 


ACCAACGCGC 


ATGGCACGGA 


TTCACGGACC 


GTCACTGTGG 


GTGTGGAATG 


4800 


TGAGTGAGGA 


CAGCGCTGAA 


TGAAGACGAC 


TCAGACCGCC 


AGAAAAGTGC 


CTTGAGGCCT 


4860 


GGGATGTATG 


ATC CAGTGGG 


TAGAGTGCTC 


AATTAGCACT 


CACTAAAATG 


TATATTCTAT 


4920 


TCCTAATACT 


CTTTAATTTT 


ANCCTTTGGG 


AGGCAGAGAC 


AGGCAGATCT 


CTGTTCCGGG 


4980 
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ATAACCTGCT CTCTGTCTAG GACAGCTTGG TCTACAGAGG GGNTACAGGC CCCCCCTCCC 5040 
AAGATTGNAT AGCAACCCTC TGGCTCCCTG TCTCTCT 5077 
(2) INFORMATION FOR SEQ ID NO:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1472 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 



NGAATTCCGG 


CGGATCGGGT 


AG AG CTAGTG 


CCTCTGCCTC 


CTTGGCAGCC 


TGTAGGTGAG 


60 


AACTTCACCT 


TGAGCTGCAG 


GGTCCCGGGG 


GCAGGACCCC 


GAGCGAGCCT 


CACATTGACC 


120 


TTGCTGCGAG 


GCGGCCAGGA 


GCTGATTCGC 


CGAAGTTTCG 


TAGGCGAGCC 


ACCCCGAGCT 


180 


CGGGGTGCGA 


TGCTCACCGC 


CACGGTCCTG 


GCGCGCAGAG 


AGGATCACAG 


GGCCAATTTC 


240 


TCATGCCTCG 


CGGAGCTTGA 


CCTGCGGCCA 


CACGGCTTGG 


GACTGTTTGC AAACAGCTCA 


300 


GCCCCCAGAC 


AGCTCCGCAC 


GTTTGCCATG 


CCTCCACTTT 


CCCCGAGCCT 


TATTGCCCCA 


360 


CGATTCTTAG 


AAGTGGGCTC 


AGAAAGGCCG 


GTGACTTGCA 


CTTTGGATGG 


ACTGTTTCCT 


420 


GCCCCAGAAG 


CCGGGGTTTA 


CCTCTCTCTG 


GGAGATCAGA 


GGCTTCATCC 


TAATGTGACC 


480 


CTCGACGGGG 


AGAGCCTTGT 


GGCCACTGCC 


ACAG CTACAG 


CAAGTGAAGA 


ACAGGAAGGC 


540 


ACCAAACAGC 


TGATGTGCAT 


CGTGACCCTC 


GGGGGCGAAA 


GCAGGGAGAC 


CCAGGAAAAC 


600 


CTGACTGTCT 


ACAGCTTCCC 


GGCTCCTCTT 


CTGACTTTAA GTGAGCCAGA AGCCCCCGAG 


660 


GGAAAGATGG 


TGACCGTAAG 


CTGCTGGGCA 


GGGGCCCGAG 


CCCTTGTCAC 


CTTGGAGGGA 


720 


ATTCCAGCTG 


CGGTCCCTGG 


GCAGCCCGCT 


GAGCTCCAGT 


TAAATGTCAC 


AAAGAATGAC 


780 


GACAAGCGGG 


GCTTCTTCTG 


CGACGCTGCC 


CTCGATGTGG 


ACGGGGAAAC 


TCTGAGAAAG 


840 


AACCAGAGCT 


CTGAGCTTCG 


TGTTCTGTGT 


GAGTGGATGT 


TCACTTTATC 


TCTGTGAATT 


900 


CCAAGGACCC 


TCTTACCGGC 


CCCATCTTTA 


ACCTTATCGT 


ATCCCCTCTG 


CCTCATGCCC 


960 


GCAGACGCAC 


CTCGGCTGGA 


TGACTTGGAC 


TGTCCCAGGA GCTGGACGTG 


GCCAGAGGGT 


1020 


CCAGAGCAGA 


CCCTCCACTG 


CGAGGCCCGT 


GGAAACCCTG 


AGCCCTCCGT 


GCACTGTGCA 


1080 


AGGCCTGACG 


GTGGGGCGGT 


GCTAGCGCTG 


GGCCTGTTGG 


GTCCAGTGAC 


CCGTGCCCTC 


1140 


GCGGGCACTT 


ACCGATGTAC 


AGCAATCAAT 


GGGCAAGGCC 


AGGCGGTCAA 


GGATGTGACC 


1200 


CTGACTGTGG 


AATATGCCCC 


AGCGCTGGAC 


AGTGTAGGCT 


GCCCAGAACG 


TATTACTTGG 


1260 


CTGGAGGGGA 


CAGAGGCATC 


GCTTAGCTGT 


GTGGCACACG 


GGGTCCCACC 


ACCTAGCGTG 


1320 


AGCTGTGTGC 


GCTCTGGAAA 


GGAGGAAGTC 


ATGGAAGGGC 


CCCTGCGTTT 


TGGCCGGGAG 


1380 


CACGCTGGCA 


CTTAC CGATG 


CGAAGCCATC 


AACGCCAGGG 


GATCAGCGGC 


CAAAAATGTG 


1440 
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GCTGTCACGG TGGAATATGG TCCCCGGAAT TC 1472 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2550 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

CCTCTGCCTC CTTGGCAGCC TGTAGGTGAG AACTTCACCT TGAGCTGCAG GGTCCCGGGG 60 

GCAGGACCCC GAGCGAGCCT CACATTG AC C TTGCTGCGAG GCGGCCAGGA GCTGATTCGC 120 

CGAAGTTTCG TAGGCGAGCC ACCCCGAGCT CGGGGTGCGA TGCTCACCGC CACGGTCCTG 180 

GCGCGCAGAG AGGATCACAG GGCCAATTTC TCATGCCTCG CGGAGCTTGA CCTGCGGCCA 240 

CACGGCTTGG GACTGTTTGC AAACAGCTCA GCCCCCAGAC AGCTCCGCAC GTTTGCCATG 300 

CCTCCACTTT CCCCGAGCCT TATTGCCCCA CGATTCTTAG AAGTGGGCTC AGAAAGGCCG 3 60 

GTGACTTGCA CTTTGGATGG ACTGTTTCCT GCCCCAGAAG CCGGGGTTTA CCTCTCTCTG 420 

GGAGATCAGA GGCTTCATCC TAATGTGACC CTCGACGGGG AGAGCCTTGT GGCCACTGCC 4 80 

ACAGCTACAG CAAGTGAAGA ACAGGAAGGC ACCAAACAGC TGATGTGCAT CGTGACCCTC 540 

GGGGGCGAAA GCAGGGAGAC CCAGGAAAAC CTGACTGTCT ACAGCTTCCC GGCTCCTCTT 600 

CTGACTTTAA GTGAGCCAGA AGCCCCCGAG GGAAAGATGG TGACCGTAAG CTGCTGGGCA 660 

GGGGCCCGAG CCCTTGTCAC CTTGGAGGGA ATTCCAGCTG CGGTCCCTGG GCAGCCCGCT 720 

GAGCTCCAGT TAAATGTCAC AAAGAATGAC GACAAG CGGG GCTTCTTCTG CGACGCTGCC 780 

CTCGATGTGG ACGGGGAAAC TCTGAGAAAG AACCAGAGCT CTGAGCTTCG TGTTCTGTAC 840 

GCACCTCGGC TGGATGACTT GGACTGTCCC AGGAGCTGGA CGTGGCCAGA GGGTCCAGAG 900 

CAGACCCTCC ACTGCGAGGC CCGTGGAAAC CCTGAGCCCT CCGTGCACTG TGCAAGGCCT 9 60 

GACGGTGGGG CGGTGCTAGC GCTGGGCCTG TTGGGTCCAG TGACCCGTGC CCTCGCGGGC 1020 

ACTTACCGAT GTACAGCAAT CAATGGGCAA GGCCAGGCGG TCAAGGATGT GACCCTGACT 1080 

GTGGAATATG CCCCAGCGCT GGACAGTGTA GGCTGCCCAG AACGTATTAC TTGGCTGGAG 1140 

GGGACAGAGG CATCGCTTAG CTGTGTGGCA CACGGGGTCC CACCACCTAG CGTGAGCTGT 1200 

GTGCGCTCTG GAAAGGAGGA AGTCATGGAA GGGCCCCTGC GTGTGGCCCG GGAGCACGCT 1260 

GGCACTTACC GATGCGAAGC CATCAACGCC AGGGGATCAG CGGCCAAAAA TGTGGCTGTC 1320 

ACGGTGGAAT ATGGTCCCAG TTTTGAGGAG TTGGGCTGCC CCAGCAACTG GACTTGGGTA 1380 

GAAGGATCTG GAAAACTGTT TTCCTGTGAA GTTGATGGGA AGCCGGAACC ACGCGTGGAG 1440 

TGCGTGGGCT CGGAGGGTGC AAGCGAAGGG GTAGTGTTGC CCCTGGTGTC CTCGAACTCT 1500 
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GGTTCCAGAA ACTCTATGAC TCCTGGTAAC CTGTCACCGG 
ACCAACCGGC ATGGCTCCAC AGTCAAAACA GTCGTCGTGA 
ATGGATGAAT CCAGTTGCCC GAGTCACCAG ACATGGCTGG 
CTGGCCTGCA GTGCCAGAGG CCGCCCCTCT CCACGCGTGC 
GCCAGGCTGG AGAGGCTACA GGTGTCCCGA GAGGATGCGG 
ACCAACGCGC ATGGCACGGA TTCACGGACC GTCACTGTGG 
GTGGCTGAGC TGGCAGCCTC GCCCCCAAGC GTGCGGCCTG 
TGCCGTGCAG AGGCCTGGCC TCCAGCCCAG ATCAGCTGGC 
AACCTCGGTC TCTCCAGCAA CAACAGCACG CTGAGCGTGG 
GGTGGCGAGT ATGAGTGCGC AGCCACCAAT GCGCATGGGC 
GTGCGCGTGG CCGGTCCATG GCTGTGGGTC GCTGTGGGCG 
CTGCTGGCCG CAGGGGCCGG CCTGGCCTTC TACGTGCAGT 
GAGTACAACG TCCAGGAGGC TGAGAGCTCA GGCGAGGCGG 
GGGACACCGG GTGCAGAAGG CGGAGCAGAG ACCCCCGGCA 
GGCGAGGTTT TCGCCATCCA GCTGACATCT TCCTGAGCCT 
C CT CGAAAG C ACAGGGGTGG ACGTATGTAT TGTTCACTCT 
GCGTCGTCCC CGTTTTCTAC CCATTCCCTT AATAAAGTTT 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 222 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
AATTCGATCA CTCGCGCTCC CCTCGCCTTC TGCGCTCTCC 
ATGCCGGGGC CTTCAC CAGG GCTGCGCCGA ACGCTCCTCG 
CTGGGGATCC TAGGCATCTC AGCGGTCGCG C TAGAAC CTT 
CGCGTGGCGC TCGTGGAGCG CGGGGGCTCG CTGTGGCTCA 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 292 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



GTATTTACCT 
GCGCGGAATC 
AAGGAGCCGA 
GCTGTTCCAG 
GGACCTACCT 
GTGTGGAATA 
GCGGAAACTT 
GCGCGCCCCC 
CGGGTGCCAT 
GCCACGCACG 
GTGCGGCAGG 
CCACCGCTTG 
TGTGTCTCAA 
CTGCCGAGTC 
GTATCCAGCT 
CTATTTATTC 
TTATAGGAGA 



CTGCAACGCC 
ACCGCCACAG 
GGCTACTGCG 
GGAAGGTGCA 
GTGTGTGGCT 
CCGGCCTGTG 
CACTCTGACC 
GGGAGCTCTC 
GGGCAGCCAT 
GCGCATCACG 
GGGCGCGGCG 
CAAGAAGGGA 
TGGCGCGGGC 
ACCTGCAGAT 
CCCCCAGGGG 
AACTCCAGGG 
AAAAAAAAAA 



CCTCCCTGGC AGCGGCGGCA 
GCCTCTGGGC TGCCCTGGGC 
TCTGGGCGGA CCTTCAGCCC 
AC 



1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2550 



120 
180 
222 
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(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 
TGTGGAGCTG GCACCCCTGC CTCCTTGGCA GCCGGTGGGC CAGAACTTCA CCCTGCGCTG GO 
CCAAGTGGAG GGTGGGTCGC CCCGGACCAG CCTCACGGTG GTGCTGCTTC GCTGGGAGGA 120 
GGAGCTGAGC CGGCAGCCCG CAGTGGAGGA GCCAGCGGAG GTCACTGCCA CTGTGCTGGC 180 
CAGCAGAGAC GACCACGGAG CCCCTTTCTC ATGCCGCACA GAACTGGACA TGCAGCCCCA 240 
GGGGCTGGGA CTGTTCGTGA ACACCTCAGC CCCCCGCCAG CTCCGAACCT TT 292 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

Pro Asp Arg Val Glu Leu Val Pro Leu Pro Pro Trp Gin Pro Val Gly 
15 10 15 

Glu Asn Phe Thr Leu Ser Cys Arg Val Pro Gly Ala Gly Pro Arg Ala 
20 25 30 

Ser Leu Thr Leu Thr Leu Leu Arg Gly Gly Gin Glu Leu lie Arg Arg 
35 40 45 

Ser Phe Val Gly Glu Pro Pro Arg Ala Arg Cys Ala Met Leu Thr Ala 
50 55 60 

Thr Val Leu Ala Arg Arg Glu Asp His Arg Asp Asn Phe Ser Cys Leu 
65 70 75 80 

Ala Glu Leu Asp Leu Arg Thr His Gly Leu Gly Leu Phe Ala Asn Ser 
85 90 95 

Ser Ala Pro Arg Gin Leu Arg Thr Phe 
100 105 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GAACTCGAGG CCATGCCTCC ACTTTCC 27 
(2) INFORMATION FOR SEQ ID NO -.15: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
CCATAAGCTT TATTCCACCG TGACAGCCAC 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
AACGTGCGGA GCTGTCTG 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
ACGGAATTCG AAGCCATCAA CGCCAGG 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CATGAATTCC GAATCTTGAG TGGGATG 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 
ATAGAATTCC TCGGGACACC TGTAGCC 
(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CARGGTGACA AGGGCTCG 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 
TATGAATTCA GTTGAGCCAC AGCGAGC 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CCGGGTCCTA GAGGTGGACA CGCA 
(2) INFORMATION FOR SEQ ID NO: 23: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
TGCAGTGTCT CCTGGCTCTG GTTC 
(2) INFORMATION FOR SEQ ID NO: 24: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 992 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

GCGAAAACCG GGAGACCCGG GAGAACGTGA CCATCTACAG CTTCCCGGCA CCACTCCTGA 60 

CCCTGAGCGA ACCCAGCGTC TCCGAGGGGC AGATGGTGAC AGTAACCTGC GCAGCTGGGG 120 

CCCAAGCTCT GGTCACACTG GAGGGAGTTC CAGCCGCGGT CCCGGGGCAG CCCGCCCAGC 180 

TTCAGCTAAA TGCCACCGAG AACGACGACA GACGCAGCTT CTTCTGCGAC GCCACCCTCG 240 

ATGTGGACGG GGAGACCCTG ATCAAGAACA GGAGCGCAGA GCTTCGTGTC CTATACGCTC 300 

CCCGGCTAGA CGATTCGGAC TGCCCCAGGA GTTGGACGTG GCCCGAGGGC CCAGAGCAGA 360 

CGCTGCGCTG CGAGGCCCGC GGGAACCCAG AACCCTCAGT GCACTGTGCG CGCTCCGACG 42 0 

GCGGGGCCGT GCTGGCTCTG GGCCTGCTGG GTCCAGTCAC TCGGGCGCTC TCAGGCACTT 4 80 

ACCGCTGCAA GGCGGCCAAT GATCAAGGCG AGGCGGTCAA GGACGTAACG CTAACGGTGG 540 

AGTACG CACC AGCGCTGGAC AGCGTGGGCT GCCCAGAACG CATTACTTGG CTGGAGGGAA 600 

CAGAAGCCTC GCTGAGCTGT GTGGCGCACG GGGTACCGCC GCCTGATGTG ATCTGCGTGC 660 

GCTCTGGAGA ACTCGGGGCC GTCATCGAGG GGCTGTTGCG TGTGGCCCGG GAGCATGCGG 720 

GCACTTACCG CTGCGAAGCC ACCAACCCTC GGGGCTCTGC GGCCAAAAAT GTGGCCGTCA 780 

CGGTG GAATA TGGCCCCAGG TTTGAGGAGC CGAGCTGCCC CAGCAATTGG ACATGGGTGG 840 

AAGGATCTGG GCGCCTGTTT TCCTGTGAGG TCGATGGGAA GCCACAGCCA AGCGTGAAGT 900 

GCGTGGGCTC CGGGGGCACC ACTGAGGGGG TGCTGCTGCC GCTGGCACCC CCAGACCCTA 960 

GTCCCAGAGC TCCCAGAATC CCTAGAGTCC TG 992 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 75 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

GCAGCCTCGC GTGGCGTTCG TGGAGCGCGG GGGCTCGCTG TGGCTGAATT GCAGCACCAA 60 

CTGCCCTCGG CCGGAGCGCG GTGGCCTGGA GACCTCGCTG CGCCGAAACG GGACCCAGAG 120 

GGGTTTGCGT TGGTTGGCGC GGCAGCTGGT GGACATTCGC GAGCCGGAGA CTCAGCCCGT 180 

CTGCTTCTTC CGCTGCGCGC GGCGCACACT ACAGGCGCGT GGGCTCATTC GCACTTTCCA 24 0 



- 81 - 



GCGACCAGAT 


CGCGTAGAGC TGATGCCGCT GCCTCCCTGG 


CAGCCGGTGG 


GCGAGAACTT 


300 


CACCCTGAGC 


TGTAGGGTCC CCGGCGCCGG GCCCCGTGCG 


AGCCTCACGC 


TGACCCTGCT 


360 


GCGGGGCGCC 


CAGGAGCTGA TCCGCCGCAG CTTCGCCGGT 


GAACCACCCC 


GAGCGCGGGG 


420 


CGCGGTGCTC 


ACAGCCACGG TACTGGCTCG GAGGGAGGAC 


CATGGAGCCA ATTTCTCGTG 


480 


TCGCGCCGAG 


CTGGACCTGC GGCCGCACGG ACTGGGACTG 


TTTGAAAACA 


GCTCGGCCCC 


540 


CAGAGAGCTC 


CGAACCTTCT CCCTGTCTCC GGATGCCCCG 


CGCCTCGCTG 


CTCCCCGGCT 


600 


CTTGGAAGTT 


GGCTCGGAAA GGCCCGTGAG CTGCACTCTG 


GACGGACTGT 


TTCCAGCCTC 


660 


AGAGGCCAGG 


GTCTACCTCG CACTGGGGGA CCAGAATCTG 


AGTCCTGATG 


TCACCCTCGA 


720 


AGGGGACGCA TTCGTGGCCA CTGCCACAGC CACAGCTAGC 


GCAGAGCAGG 


AGGGTGCCAG 


780 


GCAGCTGGTC 


TGCAACGTCA CCCTGGGGGG CGAAAACCGG 


GAGACCCGGG 


AGAACGTGAC 


840 


CATCTACAGC 


TTCCCGGCAC CACTCCTGAC CCTGAGCGAA 


CCCAGCGTCT 


CCGAGGGGCA 


900 


GATGG TGACA 


GTAACCTGCG CAG CTGGGGC CCAAGCTCTG 


GTCACACTGG 


AGGGAGTTCC 


960 


AGCCGCGGTC 


CCGGGGCAGC CCGCCCAGCT TCAGCTAAAT 


GCCACCGAGA ACGACGACAG 


1020 


ACGCAGCTTC 


TTCTGCGACG CCACCCTCGA TGTGGACGGG 


GAGACCCTGA 


TCAAGAACAG 


1080 


GAGCGCAGAG 


CTTCGTGTCC TATACGCTCC CCGGCTAGAC 


GATTCGGACT 


GCCCCAGGAG 


1140 


TTGGACGTGG 


CCCGAGGGCC CAGAGCAGAC GCTGCGCTGC 


GAGGCCCGCG 


GGAACCCAGA 


1200 


ACCCTCAGTG 


CACTGTGCGC GCTCCGACGG CGGGGCCGTG 


CTGGCTCTGG 


GCCTGCTGGG 


1260 


TCCAGTCACT 


CGGGCGCTCT CAGGCACTTA CCGCTGCAAG 


GCGGCCAATG 


ATCAAGGCGA 


1320 


GGCGGTCAAG 


GACGTAACGC TAACGGTGGA GTACGCACCA GCGCTGGACA GCGTGGGCTG 


1380 


CCCAGAACGC 


ATTACTTGGC TG GAG GGAAC AGAAGCCTCG 


CTGAGCTGTG 


TGGCGCACGG 


1440 


GGTACCGCCG 


CCTGATGTGA TCTGCGTGCG CTCTGGAGAA 


CTCGGGGCCG 


TCATCGAGGG 


1500 


GCTGTTGCGT 


GTGGCCCGGG AGCATGCGGG CACTTACCGC 


TGCGAAGCCA 


CCAACCCTCG 


1560 


GGGCTCTGCG 


GCCAAAAATG TGGCCGTCAC GGTGGAATAT 


GGCCCCAGGT 


TTGAGGAGCC 


1620 


GAGCTGCCCC 


AGCAATTGGA CATGGGTGGA AGGATCTGGG 


CGCCTGTTTT 


CCTGTGAGGT 


1680 


CGATGGGAAG 


CCACAGCCAA GCGTGAAGTG CGTGGGCTCC 


GGGGGCACCA 


CTGAGGGGGT 


1740 


GCTGCTGCCG 


CTGGCACCCC CAGACCCTAG TCCCAGAGCT 


CCCAGAATCC 


CTAGAGTCCT 


1800 


GGCACCCGGT 


ATCTACGTCT GCAACGCCAC CAACCGCCAC 


GGCTCCGTGG 


CCAAAACAGT 


1860 


CGTCGTGAGC 


GCGGAGTCGC CACCGGAGAT GGATGAATCT ACCTGCCCAA GTCACCAGAC 


1920 


GTGGCTGGAA 


GGGGCTGAGG CTTCCGCGCT GGCCTGCGCC 


GCCCGGGGTC 


GCCCTTCCCC 


1980 


AGGAGTGCGC 


TGCTCTCGGG AAGGCATCCC ATGGCCTGAG 


CAGCAGCGCG 


TGTCCCGAGA 


2040 


GGACGCGGGC 


ACTTAC CACT GTGTGGCCAC CAATGCGCAT 


GGCACGGACT 


CCCGGACCGT 


2100 


CACTGTGGGC 


GTGGAATACC GGCCAGTGGT GGC CGAACTT GCTGCCTCGC 


CCCCTGGAGG 


2160 
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CGTGCGCCCA GGAGGAAACT TCACGTTGAC CTGCCGCGCG GAGGCCTGGC CTCCAGCCCA 2220 

GATCAGCTGG CGCGCGCCCC CGAGGGCCCT CAACATCGGC CTGTCGAGCA ACAACAGCAC 2280 

ACTGAGCGTG GCAGGCGCCA TGGGAAGCCA CGGCGGCGAG TACGAGTGCG CACGCACCAA 234 0 

CGCGCACGGG CGCCACGCGC GGCGCATCAC GGTGCGCGTG GCCGGTCCGT GGCTATGGGT 2400 

CGCCGTGGGC GGCGCGGCGG GGGGCGCGGC GCTGCTGGCC GCGGGGGCCG GCCTGGCCTT 2460 

CTACGTGCAG TCCACCGCCT GCAAGAAGGG CGAGTACAAC GTGCAGGAGG CCGAGAGCTC 2520 

AGGCGAGGCC GTGTGTCTGA ACGGAGCGGG CGGCGGCGCT GGCGGGGCGG CAGGCGCGGA 2580 

GGGCGGACCC GAGGCGGCGG GGGGCGCGGC CGAGTCGCCG GCGGAGGGCG AGGTCTTCGC 2640 

CATACAGCTG ACATCGGCGT GAGCCCGCTC CCCTCTCCGC GGGCCGGGAC GCCCCCCAGA 2700 

CTCACACGGG GGCTTATTTA TTG CTTTATT TATTTACTTA TTCATTTATT TATGTATTCA 2760 

ACTCCAAGGG AATTC 2775 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1557 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

CGCGCTCTCC TCGCCTCCTG TGCTTTCCCC GCCGCGGCGA TGCCAGGGCC TTCGCCAGGG 60 

CTGCGCCGGG CGCTACTCGG CCTCTGGGCT GCTCTGGGCC TGGGGCTCTT CGGCCTCTCA 120 

GCGGTCTCGC AGGAGC CCTT CTGGGCGGAC CTGCAGCCTC GCGTGGCGTT CGTGGAGCGC 180 

GGGGGCTCGC TGTGGCTGAA TTGCAGCACC AACTGCCCTC GGCCGGAGCG CGGTGGCCTG 240 

GAGACCTCGC TGCGCCGAAA CGGGACCCAG AGGGGTTTGC GTTGGTTGGC GCGGCAGCTG 300 

GTGGACATTC GCGAGCCGGA GACTCAGCCC GTCTGCTTCT TCCGCTGCGC GCGGCGCACA 360 

CTACAGGCGC GTGGGCTCAT TCGCACTTTC CAGCGACCAG ATCG CGTAGA GCTGATGCCG 420 

CTGCCTCCCT GGCAGCCGGT GGGCGAGAAC TTCACCCTGA GCTGTAGGGT CCCCGGCGCC 480 

GGGCCCCGTG CGAGCCTCAC GCTGACCCTG CTGCGGGGCG CCCAGGAGCT GATCCGCCGC 540 

AGCTTCGCCG GTGAACCACC CCGAGCGCGG GGCGCGGTGC TCACAGCCAC GGTACTGGCT 600 

CGGAGGGAGG ACCATGGAGC CAATTTCTCG TGTCGCGCCG AGCTGGACCT GCGGCCGCAC 660 

GGACTGGGAC TGTTTGAAAA CAGCTCGGCC CCCAGAGAGC TCCGAACCTT CTCCCTGTCT 720 

CCGGATGCCC CGCGCCTCGC TGCTCCCCGG CTCTTGGAAG TTGGCTCGGA AAGGCCCGTG 780 

AGCTGCACTC TGGACGGACT GTTTCCAGCC TCAGAGG CCA GGGTCTACCT CGCACTGGGG 840 

GAC CAGAATC TGAGTCCTGA TGTCACCCTC GAAGGGGACG CATTCGTGGC CACTGCCACA 900 
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GCCACAGCTA GCGCAGAGCA GGAGGGTGCC AGGCAGCTGG TCTGCAACGT CACCCTGGGG 9 60 

GGCGAAAACC GGGAGACCCG GGAGAACGTG ACCATCTACA GCTTCCCGGC ACCACTCCTG 1020 

ACCCTGAGCG AACCCAGCGT CTCCGAGGGG CAGATGGTGA CAGTAACCTG CGCAGCTGGG 1080 

GCCCAAGCTC TGGTCACACT GGAGGGAGTT CCAGCCGCGG TCCCGGGGCA GCCCGCCCAG 1140 

CTTCAGCTAA ATGCCACCGA GAACGACGAC AGACGCAGCT TCTTCTGCGA CGCCACCCTC 1200 

GATGTGGACG GGGAGACCCT GATCAAGAAC AGGAGCGCAG AGCTTCGTGT CCTATACGCT 1260 

CCCCGGCTAG ACGATTCGGA CTGCCCCAGG AGTTGGACGT GGCCCGAGGG CCCAGAGCAG 1320 

ACGCTGCGCT GCGAGGCCCG CGGGAACCCA GAACCCTCAG TGCACTGTGC GCGCTCCGAC 1380 

GGCGGGGCCG TGCTGGCTCT GGGCCTGCTG GGTCCAGTCA CTCGGGCGCT CTCAGGCACT 1440 

TACCGCTGCA AGGCGGCCAA TGATCAAGGC GAGGCGGTCA AGGACGTAAC GCTAACGGTG 1500 

GAGTACGCAC CAGCGCTGGA CAGCGTGGGC TGCCCAGAAC GCATTACTTG GCTGGAG 1557 
(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2927 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : cDNA 



( ix) FEATURE : 

(A) NAME /KEY: CDS 

(B) LOCATION: 4 0.. 2 814 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

CGCGCTCTCC TCGCCTCCTG TGCTTTCCCC GCCGCGGCG ATG CCA GGG CCT TCG 

Met Pro Gly Pro Ser 
1 5 

CCA GGG CTG CGC CGG GCG CTA CTC GGC CTC TGG GCT GCT CTG GGC CTG 
Pro Gly Leu Arg Arg Ala Leu Leu Gly Leu Trp Ala Ala Leu Gly Leu 
10 15 20 

GGG CTC TTC GGC CTC TCA GCG GTC TCG CAG GAG CCC TTC TGG GCG GAC 
Gly Leu Phe Gly Leu Ser Ala Val Ser Gin Glu Pro Phe Trp Ala Asp 
25 30 35 

CTG CAG CCT CGC GTG GCG TTC GTG GAG CGC GGG GGC TCG CTG TGG CTG 
Leu Gin Pro Arg Val Ala Phe Val Glu Arg Gly Gly Ser Leu Trp Leu 
40 45 50 

AAT TGC AGC ACC AAC TGC CCT CGG CCG GAG CGC GGT GGC CTG GAG ACC 
Asn Cys Ser Thr Asn Cys Pro Arg Pro Glu Arg Gly Gly Leu Glu Thr 
55 60 65 

TCG CTG CGC CGA AAC GGG ACC CAG AGG GGT TTG CGT TGG TTG GCG CGG 
Ser Leu Arg Arg Asn Gly Thr Gin Arg Gly Leu Arg Trp Leu Ala Arg 
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CAG CTG GTG GAC ATT CGC GAG CCG GAG ACT CAG CCC GTC TGC TTC TTC 
Gin Leu Val Asp lie Arg Glu Pro Glu Thr Gin Pro Val Cys Phe Phe 
90 95 100 

CGC TGC GCG CGG CGC ACA CTA CAG GCG CGT GGG CTC ATT CGC ACT TTC 
Arg Cys Ala Arg Arg Thr Leu Gin Ala Arg Gly Leu lie Arg Thr Phe 
105 110 115 

CAG CGA CCA GAT CGC GTA GAG CTG ATG CCG CTG CCT CCC TGG CAG CCG 
| Gin Arg Pro Asp Arg Val Glu Leu Met Pro Leu Pro Pro Trp Gin Pro 

9 120 125 130 

GTG GGC GAG AAC TTC ACC CTG AGC TGT AGG GTC CCC GGC GCC GGG CCC 
Val Gly Glu Asn Phe Thr Leu Ser Cys Arg Val Pro Gly Ala Gly Pro 
135 140 145 

CGT GCG AGC CTC ACG CTG ACC CTG CTG CGG GGC GCC CAG GAG CTG ATC 
Arg Ala Ser Leu Thr Leu Thr Leu Leu Arg Gly Ala Gin Glu Leu lie 
150 155 160 165 

p CGC CGC AGC TTC GCC GGT GAA CCA CCC CGA GCG CGG GGC GCG GTG CTC 
p= Arg Arg Ser Phe Ala Gly Glu Pro Pro Arg Ala Arg Gly Ala Val Leu 
p\ 170 175 ~ * 180 

|0 ACA GCC ACG GTA CTG GCT CGG AGG GAG GAC CAT GGA GCC AAT TTC TCG 
|iri Thr Ala Thr Val Leu Ala Arg Arg Glu Asp His Gly Ala Asn Phe Ser 
fu 185 190 195 

*F TGT CGC GCC GAG CTG GAC CTG CGG CCG CAC GGA CTG GGA CTG TTT GAA 
u Cys Arg Ala Glu Leu Asp Leu Arg Pro His Gly Leu Gly Leu Phe Glu 
s U 200 205 210 

jy AAC AGC TCG GCC CCC AGA GAG CTC CGA ACC TTC TCC CTG TCT CCG GAT 
M= Asn Ser Ser Ala Pro Arg Glu Leu Arg Thr Phe Ser Leu Ser Pro Asp 
JO 215 22 0 225 

J =l GCC CCG CGC CTC GCT GCT CCC CGG CTC TTG GAA GTT GGC TCG GAA AGG 
r~ Ala Pro Arg Leu Ala Ala Pro Arg Leu Leu Glu Val Gly Ser Glu Arg 
230 235 240 245 

CCC GTG AGC TGC ACT CTG GAC GGA CTG TTT CCA GCC TCA GAG GCC AGG 
Pro Val Ser Cys Thr Leu Asp Gly Leu Phe Pro Ala Ser Glu Ala Arg 
250 255 260 

GTC TAC CTC GCA CTG GGG GAC CAG AAT CTG AGT CCT GAT GTC ACC CTC 
Val Tyr Leu Ala Leu Gly Asp Gin Asn Leu Ser Pro Asp Val Thr Leu 
265 270 275 

GAA GGG GAC GCA TTC GTG GCC ACT GCC ACA GCC ACA GCT AGC GCA GAG 
Glu Gly Asp Ala Phe Val Ala Thr Ala Thr Ala Thr Ala Ser Ala Glu 
j| 280 285 290 

CAG GAG GGT GCC AGG CAG CTG GTC TGC AAC GTC ACC CTG GGG GGC GAA 
Gin Glu Gly Ala Arg Gin Leu Val Cys Asn Val Thr Leu Gly Gly Glu 
295 300 305 

AAC CGG GAG ACC CGG GAG AAC GTG ACC ATC TAC AGC TTC CCG GCA CCA 
Asn Arg Glu Thr Arg Glu Asn Val Thr He Tyr Ser Phe Pro Ala Pro 
310 315 320 325 

CTC CTG ACC CTG AGC GAA CCC AGC GTC TCC GAG GGG CAG ATG GTG ACA 
Leu Leu Thr Leu Ser Glu Pro Ser Val Ser Glu Gly Gin Met Val Thr 
330 335 340 
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GTA ACC TGC GCA GCT GGG GCC CAA GCT CTG GTC ACA CTG GAG GGA GTT 1110 
Val Thr Cys Ala Ala Gly Ala Gin Ala Leu Val Thr Leu Glu Gly Val 
345 350 355 

CCA GCC GCG GTC CCG GGG CAG CCC GCC CAG CTT CAG CTA AAT GCC ACC 1158 
Pro Ala Ala Val Pro Gly Gin Pro Ala Gin Leu Gin Leu Asn Ala Thr 
360 365 370 

GAG AAC GAC GAC AGA CGC AGC TTC TTC TGC GAC GCC ACC CTC GAT GTG 1206 
Glu Asn Asp Asp Arg Arg Ser Phe Phe Cys Asp Ala Thr Leu Asp Val 
375 380 385 

GAC GGG GAG ACC CTG ATC AAG AAC AGG AGC GCA GAG CTT CGT GTC CTA 1254 
Asp Gly Glu Thr Leu He Lys Asn Arg Ser Ala Glu Leu Arg Val Leu 
390 395 400 405 

TAC GCT CCC CGG CTA GAC GAT TCG GAC TGC CCC AGG AGT TGG ACG TGG 1302 
Tyr Ala Pro Arg Leu Asp Asp Ser Asp Cys Pro Arg Ser Trp Thr Trp 
410 415 420 

CCC GAG GGC CCA GAG CAG ACG CTG CGC TGC GAG GCC CGC GGG AAC CCA 1350 
Pro Glu Gly Pro Glu Gin Thr Leu Arg Cys Glu Ala Arg Gly Asn Pro 
425 430 435 

GAA CCC TCA GTG CAC TGT GCG CGC TCC GAC GGC GGG GCC GTG CTG GCT 1398 
Glu Pro Ser Val His Cys Ala Arg Ser Asp Gly Gly Ala Val Leu Ala 
440 445 450 

CTG GGC CTG CTG GGT CCA GTC ACT CGG GCG CTC TCA GGC ACT TAC CGC 1446 
Leu Gly Leu Leu Gly Pro Val Thr Arg Ala Leu Ser Gly Thr Tyr Arg 
455 460 465 

TGC AAG GCG GCC AAT GAT CAA GGC GAG GCG GTC AAG GAC GTA ACG CTA 1494 
Cys Lys Ala Ala Asn Asp Gin Gly Glu Ala Val Lys Asp Val Thr Leu 
470 475 480 485 

ACG GTG GAG TAC GCA CCA GCG CTG GAC AGC GTG GGC TGC CCA GAA CGC 1542 
Thr Val Glu Tyr Ala Pro Ala Leu Asp Ser Val Gly Cys Pro Glu Arg 
490 495 500 

ATT ACT TGG CTG GAG GGA ACA GAA GCC TCG CTG AGC TGT GTG GCG CAC 1590 
He Thr Trp Leu Glu Gly Thr Glu Ala Ser Leu Ser Cys Val Ala His 
505 510 515 

GGG GTA CCG CCG CCT GAT GTG ATC TGC GTG CGC TCT GGA GAA CTC GGG 1638 
Gly Val Pro Pro Pro Asp Val He Cys Val Arg Ser Gly Glu Leu Gly 
520 525 530 

GCC GTC ATC GAG GGG CTG TTG CGT GTG GCC CGG GAG CAT GCG GGC ACT 1686 
Ala Val lie Glu Gly Leu Leu Arg Val Ala Arg Glu His Ala Gly Thr 
535 540 545 

TAC CGC TGC GAA GCC ACC AAC CCT CGG GGC TCT GCG GCC AAA AAT GTG 1734 
Tyr Arg Cys Glu Ala Thr Asn Pro Arg Gly Ser Ala Ala Lys Asn Val 
550 555 560 565 

GCC GTC ACG GTG GAA TAT GGC CCC AGG TTT GAG GAG CCG AGC TGC CCC 1782 
Ala Val Thr Val Glu Tyr Gly Pro Arg Phe Glu Glu Pro Ser Cys Pro 
570 575 580 

AGC AAT TGG ACA TGG GTG GAA GGA TCT GGG CGC CTG TTT TCC 'TGT GAG 1830 
Ser Asn Trp Thr Trp Val Glu Gly Ser Gly Arg Leu Phe Ser Cys Glu 
585 590 595 
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GTC GAT GGG AAG CCA CAG CCA AGC GTG AAG TGC GTG GGC TCC GGG GGC 
Val Asp Gly Lys Pro Gin Pro Ser Val Lys Cys Val Gly Ser Gly Gly 
600 605 610 

ACC ACT GAG GGG GTG CTG CTG CCG CTG GCA CCC CCA GAC CCT AGT CCC 
Thr Thr Glu Gly Val Leu Leu Pro Leu Ala Pro Pro Asp Pro Ser Pro 
615 620 625 

AGA GCT CCC AGA ATC CCT AGA GTC CTG GCA CCC GGT ATC TAC GTC TGC 
Arg Ala Pro Arg lie Pro Arg Val Leu Ala Pro Gly lie Tyr Val Cys 
630 635 640 645 

AAC GCC ACC AAC CGC CAC GGC TCC GTG GCC AAA ACA GTC GTC GTG AGC 
Asn Ala Thr Asn Arg His Gly Ser Val Ala Lys Thr Val Val Val Ser 
650 655 660 

GCG GAG TCG CCA CCG GAG ATG GAT GAA TCT ACC TGC CCA AGT CAC CAG 
Ala Glu Ser Pro Pro Glu Met Asp Glu Ser Thr Cys Pro Ser His Gin 
665 670 675 

ACG TGG CTG GAA GGG GCT GAG GCT TCC GCG CTG GCC TGC GCC GCC CGG 
Thr Trp Leu Glu Gly Ala Glu Ala Ser Ala Leu Ala Cys Ala Ala Arg 
680 685 690 

GGT CGC CCT TCC CCA GGA GTG CGC TGC TCT CGG GAA GGC ATC CCA TGG 
Gly Arg Pro Ser Pro Gly Val Arg Cys Ser Arg Glu Gly lie Pro Trp 
695 700 705 

CCT GAG CAG CAG CGC GTG TCC CGA GAG GAC GCG GGC ACT TAC CAC TGT 
Pro Glu Gin Gin Arg Val Ser Arg Glu Asp Ala Gly Thr Tyr His Cys 
710 715 720 725 

GTG GCC ACC AAT GCG CAT GGC ACG GAC TCC CGG ACC GTC ACT GTG GGC 
Val Ala Thr Asn Ala His Gly Thr Asp Ser Arg Thr Val Thr Val Gly 
730 735 740 

GTG GAA TAC CGG CCA GTG GTG GCC GAA CTT GCT GCC TCG CCC CCT GGA 
Val Glu Tyr Arg Pro Val Val Ala Glu Leu Ala Ala Ser Pro Pro Gly 
745 750 755 

GGC GTG CGC CCA GGA GGA AAC TTC ACG TTG ACC TGC CGC GCG GAG GCC 
Gly Val Arg Pro Gly Gly Asn Phe Thr Leu Thr Cys Arg Ala Glu Ala 
760 765 770 

TGG CCT CCA GCC CAG ATC AGC TGG CGC GCG CCC CCG AGG GCC CTC AAC 
Trp Pro Pro Ala Gin lie Ser Trp Arg Ala Pro Pro Arg Ala Leu Asn 
775 780 785 

ATC GGC CTG TCG AGC AAC AAC AGC ACA CTG AGC GTG GCA GGC GCC ATG 
He Gly Leu Ser Ser Asn Asn Ser Thr Leu Ser Val Ala Gly Ala Met 
790 795 800 805 

GGA AGC CAC GGC GGC GAG TAC GAG TGC GCA CGC ACC AAC GCG CAC GGG 
Gly Ser His Gly Gly Glu Tyr Glu Cys Ala Arg Thr Asn Ala His Gly 
810 815 820 

CGC CAC GCG CGG CGC ATC ACG GTG CGC GTG GCC GGT CCG TGG CTA TGG 
Arg His Ala Arg Arg He Thr Val Arg Val Ala Gly Pro Trp Leu Trp 
825 830 835 

GTC GCC GTG GGC GGC GCG GCG GGG GGC GCG GCG CTG CTG GCC GCG GGG 
Val Ala Val Gly Gly Ala Ala Gly Gly Ala Ala Leu Leu Ala Ala Gly 
840 845 850 
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GCC GGC CTG GCC TTC TAC GTG CAG TCC ACC GCC TGC AAG AAG GGC GAG 2 646 

Ala Gly Leu Ala Phe Tyr Val Gin Ser Thr Ala Cys Lys Lys Gly Glu 
855 860 865 

TAC AAC GTG CAG GAG GCC GAG AGC TCA GGC GAG GCC GTG TGT CTG AAC 2 694 

Tyr Asn Val Gin Glu Ala Glu Ser Ser Gly Glu Ala Val Cys Leu Asn 
870 875 880 885 

GGA GCG GGC GGC GGC GCT GGC GGG GCG GCA GGC GCG GAG GGC GGA CCC 2 742 

Gly Ala Gly Gly Gly Ala Gly Gly Ala Ala Gly Ala Glu Gly Gly Pro 
890 895 900 

GAG GCG GCG GGG GGC GCG GCC GAG TCG CCG GCG GAG GGC GAG GTC TTC 2790 
Glu Ala Ala Gly Gly Ala Ala Glu Ser Pro Ala Glu Gly Glu Val Phe 
905 910 915 

GCC ATA CAG CTG ACA TCG GCG TGAGCCCGCT CCCCTCTCCG CGGGCCGGGA 2 841 

Ala lie Gin Leu Thr Ser Ala 

920 925 

CGCCCCCCAG ACTCACACGG GGGCTTATTT ATTGCTTTAT TTATTTACTT ATTCATTTAT 2901 

TTATGTATTC AACTCCAAGG GAATTC 2927 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 924 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Met Pro Gly Pro Ser Pro Gly Leu Arg Arg Ala Leu Leu Gly Leu Trp 
15 10 15 

Ala Ala Leu Gly Leu Gly Leu Phe Gly Leu Ser Ala Val Ser Gin Glu 
20 25 30 

Pro Phe Trp Ala Asp Leu Gin Pro Arg Val Ala Phe Val Glu Arg Gly 
35 ~ 40 45 

Gly Ser Leu Trp Leu Asn Cys Ser Thr Asn Cys Pro Arg Pro Glu Arg 
50 55 60 

Gly Gly Leu Glu Thr Ser Leu Arg Arg Asn Gly Thr Gin Arg Gly Leu 
65 70 75 80 

Arg Trp Leu Ala Arg Gin Leu Val Asp lie Arg Glu Pro Glu Thr Gin 
85 90 95 

Pro Val Cys Phe Phe Arg Cys Ala Arg Arg Thr Leu Gin Ala Arg Gly 
100 " 105 " 110 

Leu lie Arg Thr Phe Gin Arg Pro Asp Arg Val Glu Leu Met Pro Leu 
115 120 125 

Pro Pro Trp Gin Pro Val Gly Glu Asn Phe Thr Leu Ser Cys Arg Val 
130 135 140 



Pro Gly Ala Gly Pro Arg Ala Ser Leu Thr Leu Thr Leu Leu Arg Gly 
145 150 155 160 

Ala Gin Glu Leu lie Arg Arg Ser Phe Ala Gly Glu Pro Pro Arg Ala 
165 170 175 

Arg Gly Ala Val Leu Thr Ala Thr Val Leu Ala Arg Arg Glu Asp His 
180 185 190 

Gly Ala Asn Phe Ser Cys Arg Ala Glu Leu Asp Leu Arg Pro His Gly 
195 200 205 

Leu Gly Leu Phe Glu Asn Ser Ser Ala Pro Arg Glu Leu Arg Thr Phe 
210 215 220 

Ser Leu Ser Pro Asp Ala Pro Arg Leu Ala Ala Pro Arg Leu Leu Glu 
225 230 235 240 

Val Gly Ser Glu Arg Pro Val Ser Cys Thr Leu Asp Gly Leu Phe Pro 
245 250 ^ 255 

Ala Ser Glu Ala Arg Val Tyr Leu Ala Leu Gly Asp Gin Asn Leu Ser 
260 265 270 

Pro Asp Val Thr Leu Glu Gly Asp Ala Phe Val Ala Thr Ala Thr Ala 
275 280 285 

Thr Ala Ser Ala Glu Gin Glu Gly Ala Arg Gin Leu Val Cys Asn Val 
290 295 300 

Thr Leu Gly Gly Glu Asn Arg Glu Thr Arg Glu Asn Val Thr lie Tyr 
305 310 315 320 

Ser Phe Pro Ala Pro Leu Leu Thr Leu Ser Glu Pro Ser Val Ser Glu 
325 330 335 

Gly Gin Met Val Thr Val Thr Cys Ala Ala Gly Ala Gin Ala Leu Val 
340 345 350 

Thr Leu Glu Gly Val Pro Ala Ala Val Pro Gly Gin Pro Ala Gin Leu 
355 360 365 

Gin Leu Asn Ala Thr Glu Asn Asp Asp Arg Arg Ser Phe Phe Cys Asp 
370 375 380 

Ala Thr Leu Asp Val Asp Gly Glu Thr Leu lie Lys Asn Arg Ser Ala 
385 390 395 400 

Glu Leu Arg Val Leu Tyr Ala Pro Arg Leu Asp Asp Ser Asp Cys Pro 
405 410 415 

Arg Ser Trp Thr Trp Pro Glu Gly Pro Glu Gin Thr Leu Arg Cys Glu 
420 425 430 

Ala Arg Gly Asn Pro Glu Pro Ser Val His Cys Ala Arg Ser Asp Glv 
435 440 445 

Gly Ala Val Leu Ala Leu Gly Leu Leu Gly Pro Val Thr Arg Ala Leu 
450 455 460 

Ser Gly Thr Tyr Arg Cys Lys Ala Ala Asn Asp Gin Gly Glu Ala Val 
465 470 475 " 480 



Lys Asp Val Thr Leu Thr Val Glu Tyr Ala Pro Ala Leu Asp Ser Val 
485 490 495 

Gly Cys Pro Glu Arg lie Thr Trp Leu Glu Gly Thr Glu Ala Ser Leu 
500 505 ~ 510 

Ser Cys Val Ala His Gly Val Pro Pro Pro Asp Val lie Cys Val Arg 
515 520 525 

Ser Gly Glu Leu Gly Ala Val lie Glu Gly Leu Leu Arg Val Ala Arg 
530 535 540 

Glu His Ala Gly Thr Tyr Arg Cys Glu Ala Thr Asn Pro Arg Gly Ser 
545 550 555 560 

Ala Ala Lys Asn Val Ala Val Thr Val Glu Tyr Gly Pro Arg Phe Glu 
565 570 575 

Glu Pro Ser Cys Pro Ser Asn Trp Thr Trp Val Glu Gly Ser Gly Arg 
580 585 590 

Leu Phe Ser Cys Glu Val Asp Gly Lys Pro Gin Pro Ser Val Lys Cys 
595 600 605 

Val Gly Ser Gly Gly Thr Thr Glu Gly Val Leu Leu Pro Leu Ala Pro 
610 615 620 

Pro Asp Pro Ser Pro Arg Ala Pro Arg lie Pro Arg Val Leu Ala Pro 
625 630 635 640 

Gly lie Tyr Val Cys Asn Ala Thr Asn Arg His Gly Ser Val Ala Lys 
645 650 ' 655 

Thr Val Val Val Ser Ala Glu Ser Pro Pro Glu Met Asp Glu Ser Thr 
660 665 670 

Cys Pro Ser His Gin Thr Trp Leu Glu Gly Ala Glu Ala Ser Ala Leu 
675 680 685 

Ala Cys Ala Ala Arg Gly Arg Pro Ser Pro Gly Val Arg Cys Ser Arg 
690 695 700 

Glu Gly He Pro Trp Pro Glu Gin Gin Arg Val Ser Arg Glu Asp Ala 
705 710 715 720 

Gly Thr Tyr His Cys Val Ala Thr Asn Ala His Gly Thr Asp Ser Arg 
725 730 ' 735 

Thr Val Thr Val Gly Val Glu Tyr Arg Pro Val Val Ala Glu Leu Ala 
740 745 750 

Ala Ser Pro Pro Gly Gly Val Arg Pro Gly Gly Asn Phe Thr Leu Thr 
755 760 ' 765 

Cys Arg Ala Glu Ala Trp Pro Pro Ala Gin He Ser Trp Arg Ala Pro 
770 775 780 

Pro Arg Ala Leu Asn He Gly Leu Ser Ser Asn Asn Ser Thr Leu Ser 
785 790 795 800 

Val Ala Gly Ala Met Gly Ser His Gly Gly Glu Tyr Glu Cys Ala Arg 
805 810 " 815 



Thr Asn Ala His Gly Arg His Ala Arg Arg lie Thr Val Arg Val Ala 
820 825 830 

Gly Pro Trp Leu Trp Val Ala Val Gly Gly Ala Ala Gly Gly Ala Ala 
835 840 845 

Leu Leu Ala Ala Gly Ala Gly Leu Ala Phe Tyr Val Gin Ser Thr Ala 
850 855 8S0 

Cys Lys Lys Gly Glu Tyr Asn Val Gin Glu Ala Glu Ser Ser Gly Glu 
865 870 875 880 

Ala Val Cys Leu Asn Gly Ala Gly Gly Gly Ala Gly Gly Ala Ala Gly 
885 890 895 

Ala Glu Gly Gly Pro Glu Ala Ala Gly Gly Ala Ala Glu Ser Pro Ala 
900 905 910 

Glu Gly Glu Val Phe Ala lie Gin Leu Thr Ser Ala 
915 920 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
G TACTTACAG GATCCGCGGT CTCGCAGGAG CCCTTCTGGG CGGACCTACA GCCTGCGTGG 
CGTTC 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 
ATTTCTCTCG AGGATGGTCA CGTTCTCCCG G 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 
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ATTTCTGGAT CCTACAGCTT CCCGGCACCA CTC 33 
(2) INFORMATION FOR SEQ ID NO:32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 
ATTTCTCTCG AGTTCCACGC CCACAGTGAC GG 32 
(2) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1687 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

GGATCCTTTG AG C C CTGAAA GTCGAGGTTG CAGTGAGCCT TGATCGTGCC ACTGCACTCC 60 

AGCCTGGGGG ACAGAGCACG ACCCTGTCTC CAAAAATAAA ATAAAAATAA AAATAAATAT 120 

TGGCGGGGGA ACCCTCTGGA ATCAATAAAG GCTTCCTTAA CCAGCCTCTG TCCTGTGACC 180 

TAAGGGTCCG' CATTACTGCC CTTCTTCGGA GGAACTGGTT TGTTTTTGTT GTTGTTGTTG 24 0 

TTTTTGCGAT CACTTTCTCC AAGTTCCTTG TCTCCCTGAG GGCACCTGAG GTTTCCTCAC 300 

TCAGGGCCCA CCTGGGGTCC CGAAGCCCCA GACTCTGTGT ATCCCCAGCG GGTGTCACAG 3 60 

AAACCTCTCC TTCTGCTGGC CTTATCGAGT GGGATCAGCG CGGCCGGGGA GAGCCACGGG 420 

CAGGGGCGGG GTGGGGTTCA TGGTATGGCT TTC CTGATTG GCGCCGCCGC CACCACGCGG 480 

CAGCTCTGAT TGGATGTTAA GTTTCCTATC CCAGCCCCAC CTTCAGACCC TGTGCTTTCC 540 

TGGAGGCCAA ACAACTGTGG AG CGAGAACT CATCTCCAAA ATAACTTACC ACGCTGGAGT 600 

GAGACCACGA ATGGTGGGGA GGGGAGGGTC CCACGGACAT ATTGAGGGAC GTGGATACGC 660 

AGAAGAGGTA TCCATGTGGT GGCAGCCGGG AAGGGGTGAT CAGATGGTCC ACAGGGAATA 720 

TCACAAACTC GAATTCTGAC GATGTTCTGG TAGTCACCCA GCCAGATGAG CGCATGGAGT 780 

TGGCGGTGGG GGGTGTCAAA GCTTGGGGCC CGGAAGCGGA GTCAAAAGCA TCACCCTCGG 840 

TCCCTTGTTC TCGCGTGGAT GTCAGGGCCT CCACCCACCG AGCAGAAGGC GGACTCAGGG 900 

GCGCTCCAGG GTGGCTCGAG CTCACACACG CTGAGTAGAC ACGTGCCCGC TGCACCCTGG 960 

GTAAATACAG ACCCGGAGCC GAGCGGATTC TAATTTAGAC GCCCGCGAAC GCTGCGCGCA 1020 

CGCACACGTG TCCTCGGCTC GCTGGCACTT TCGTCCCGCC CCCTCCGTCG CGTGCCGGAG 1080 
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CTGACC CGGA 


GGGGTGCTTA 


GAGGTATGGC 


TCCGCGGGGT 


CAAAAGGAGA AGGATCAGTG 


1140 


AGAGAGGATC 


CCCACACCCT 


CCCCTAGAAC 


TGTCCTTTCC 


CCATCCAGTG 


CCTCCCAAAT 


1200 


CTCTCTTAGT 


CCCCAAATGT 


ATCCCCGCCC 


TAAGGGGCGC 


TGGTGGGAGG 


AGCTAAATGT 


1260 


GGGGGCGGAG 


CTCGGAGTCC 


AGCTTATTAT 


GATGGCATCT 


CAGCCAGGGC 


TGGGGTAGGG 


1320 


GTTTGGGAAG 


GGCAACCCAG 


CATCCCCCGA 


TCCCAGAGTC 


GCGGCCGGGG 


ATGACGCGAG 


1380 


AGAGCGTGGT 


CGCCCCCAGA 


AGGCCCTGGG 


CCATCATGCC 


GGCCTCCACG 


TAGACCCCAG 


1440 


GGGTCGCTCA 


CTCCTGCCAG 


CTCGCCTTCA 


CCAAGGCCAG 


GAGCTTAGCG 


CACGCTCGCC 


1500 


TCCCGCCCCC 


CCGCCGCCTC 


TGCCGCCGCC 


CCCTCCTTGG AAACCAAGTT AC CAACGTTA 


1560 


AACCAATCCC 


CAAGCGCAAC 


TCTGTCTCCC 


CCACACCCCA 


CCCGCCGCGC 


CGCGCGGAGC 


1620 


CGTCCTCTAG 


CCCAGCTCCT 


CGGCTCGCGC 


TCTCCTCGCC 


TCCTGTGCTT 


TCCCCGCCGC 


1680 


GGCGATG 












1687 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 6 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 
CAGAACTAAG CTTACAGGAG GCGAGGAGAG CGCGAG 3 6 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
CAACAATGCT AGCCAAGCGC AACTCTGTCT C 31 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
CAACAATGCT AGCCTTGGAA AC CAAGTTAC C 31 
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(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 
CAACAATGCT AGCAGGAGCT TAGCGCACGC TCG 
(2) INFORMATION FOR SEQ ID NO:38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 
CAACAATGCT AGCCATGCCG GCCTCCACGT AG 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39 
CAACAATGCT AGCGTCCAGC TTATTATCAT G 
(2) INFORMATION FOR SEQ ID NO:40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40 
CAACAATGCT AGCCTTAGTC CCCAAATGTA TC 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 
CAACAATGCT AGCGGAGAAG GATCAGTGAG 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 
CAACAATGCT AGCCTCCACC CACCGAGCAG AAG 



